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Description 

The present Invention relates to cloning of immunoglobulin (Ig) variable domain sequences. Methods for 
cloning, amplifying and expressing DNA sequences encoding at least part of an immunoglobuiin variable 
5 domain and metliods for the use of said DNA sequences in the production of Ig-type molecules are 
disclosed. 

A list of references is appended to the end of the description. The documents listed therein are referred 
to in the description by number, which is given in square brackets []. 

The Ig superfamily includes not only the Igs themselves but also such molecules as receptors on 

JO lymphoid ceils such as T lymphocytes. Immunoglobulins comprise at least one heavy and one light chain 
covalently bonded together. Each chain is divided into a number of domains. At the N tenminal end of each 
chain is a variable domain. The variable domains on the heavy and light chains fit together to fonm a 
binding site designed to receive a particular target molecule. In the case of Igs, the target molecules are 
antigens. T-cell receptors have two chains of equal size, the a and j3 chains, each consisting of two 

76 domains. At the N-terminal end of each chain is a variable domain and the variable domains on the a and fi 
chains are believed to fit together to form a binding site for target molecules, in this case peptides 
presented by a histocompatibility antigen. The variable domains are so called because their amino acid 
sequences vary particularly from one molecule to another. This variation in sequence enables the molecules 
to recognise an extremely wide variety of target molecules. 

20 Much research has been carried out on Ig molecules to determine how the variable domains are 
produced. It has been shown that each variable domain comprises a number of areas of relatively 
conserved sequence and three areas of hypervariable sequence. The three hypervariable areas are 
generally known as complementarity determining regions (CDRs). 

Crystallographic studies have shown that in each variable domain of an Ig molecule, the CDRs are 

25 supported on framework areas formed by the areas of conserved sequences. The three CDRs are brought 
together by the framework areas and. together with the CDRs on the other chain, form a pocket in which the 
target molecule is received. 

Since the advent of recombinant DNA technology, there has been much interest In the use of such 
technology to clone and express Ig molecules and derivatives thereof. This interest is reflected In the 

30 numbers of patent applications and other publications on the subject. 

The earliest work on the cloning and expression of full Igs in the patent literature is EP-A-0 120 694 
(Boss). The Boss application also relates to the cloning and expression of chimeric antibodies. Chimeric 
antibodies are Ig-type molecules In which the variable domains from one Ig are fused to constant domains 
from another Ig. Usually, the variable domains are derived from an Ig from one species (often a mouse Ig) 

35 and the constant domains are derived from an Ig from a different species (often a human Ig). 

A later European patent application, EP-A-0 125 023 (Genentech), relates to much the same subject as 
the Boss application, but also relates to the production by recombinant DNA technology of other variations 
of Ig-type molecules. 

EP-A-0 194 276 (Neuh>erger) discloses not only chimeric antibodies of the type disclosed in the Boss 

40 application but also chimeric antibodies in which some or all of the constant domains have been replaced 
by non-lg derived protein sequences. For instance, the heavy chain CH2 and CH3 domains may be 
replaced by protein sequences derived from an enzyme or a protein toxin. 

EP-A-0 239 400 (Winter) discloses a different approach to the production of Ig molecules. In this 
approach, only the CDRs from a first type of Ig are grafted onto a second type of Ig in place of its normal 

45 CDRs. The Ig molecule thus produced is predominantly of the second type, since the CDRs form a 
relatively small part of the whole Ig. However, since the CDRs are the parts which define the specificity of 
the Ig, the Ig molecule thus produced has its specificity derived from the first Ig. 

Hereinafter, chimeric antibodies, CDR-grafted Igs. the altered antibodies described by Genentech. and 
fragments, of such Igs such as F(ab02 and Fv fragments are referred to herein as modified antibodies. 

50 One of the main reasons for all the activity in the Ig field using recombinant DNA technology is the 
desire to use Igs in therapy. It is well known that, using the hybrldoma technique developed by Kohler and 
Milstein. it is possible to produce monoclonal antibodies (MAbs) of almost any specificity. Thus, MAbs 
directed against cancer antigens have been produced. It is envisaged that these MAbs could be covalently 
attached or fused to toxins to provide "magic bullets" for use In cancer therapy. MAbs directed against 

55 normal tissue or cell surface antigens have also been produced. Labels can be attached to these so that 
they can be used for in vivo imaging. 

The major obstacle to the use of such MAbs in therapy or in vivo diagnosis is that the vast majority of 
MAbs which are produced are of rodent. In particular mouse, origin. It is very difficult to produce human 
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MAbs. Since most MAbs are derived from non-human species, they are antigenic in humans. Thus, 
administration of these MAbs to humans generally results in an anti-lg response being mounted by the 
human. Such a response can interfere with therapy or diagnosis, for instance by destroying or clearing the 
antibody quickly, or can cause allergic reactions or Immune complex hypersensitivity which has adverse 
5 effects on the patient. 

The production of modified tgs has been proposed to ensure that the tg administered to a patient is as 
"human" as possible, but still retains the appropriate specificity, ft is therefore expected that modified Igs 
will be as effective as the MAb from which the sp)ecificity is derived but at the same time not very antigenic. 
Thus, it should be possible to use the modified Ig a reasonable number of times in a treatment or diagnosis 
10 regime. 

At the level of the gene, it is known that heavy chain variable domains are encoded by a **rearranged" 
gene which is built from three gene segments: an "unrearranged" VH gene (encoding the N-termlnal three 
framework regions, first two complete CDRs and the first part of the third CDR), a diversity (DH)-segment 
(DH) (encoding the central portion of the third CDR) and a joining segment (JH) (encoding the last part of 
75 the third CDR and the fourth framework region). In the maturation of B-cells. the genes rearrange so that 
each unrearranged VH gene is linked to one DH gene and one JH gene. The rearranged gene corresponds 
to VH-DH-JH. This rearranged gene is linked to a gene which encodes the constant portion of the Ig chain. 

For light chains, the situation Is similar, except that for light chains there is no diversity region. Thus 
light chain variable domains are encoded by an "unrearranged" VL gene and a JL gene. There are two 
20 types of light chains, kappa (x) or lambda (k), which are built respectively from unrearranged Vx genes and 
Jx segments, and from unrearranged VX genes and JX segments. 

It has been discovered that isolated Ig heavy chain variable domains can bind to antigen in a 1:1 ratio 
and with binding constants of equivalent magnitude to those of complete antibody molecules. 

Single domain iigands consisting of at least part of the variable domain of one chain of a molecule from 
25 the Ig superfamily may be the end product of processes involving methods according to the present ' 
Invention. 

Preferably, each ligand consists of the variable domain of an Ig light, or, most preferably, heavy chain. 
If desired, a gene for a single domain ligand can be mutated to improve the properties of the expressed 
domain, for example to increase the yields of expression or the solubility of the ligand, to enable the ligand 

30 to bind better, or to introduce a second site for covalent attachment (by introducing chemically reactive 
residues such as cysteine and histidine) or non-covalent binding of other molecules. In particular it would be 
desirable to introduce a second site for binding to serum components, to prolong the residence time of the 
domains in the serum; or for binding to molecules with effector functions, such as components of 
complement, or receptors on the surfaces of cells. 

35 Thus, hydrophobic residues which would normally be at the interface of the heavy chain variable 
domain with the light chain variable domain could be mutated to more hydrophllic residues to improve 
solubility: residues in the CDR loops could be mutated to Improve antigen binding; residues on the other 
loops or parts of the i^-sheet could be mutated to introduce new binding activities. Mutations could Include 
single point mutations, multiple point mutations or more extensive changes and could be introduced by any 

40 of a variety of recombinant DNA methods, for example gene synthesis, site directed mutagenesis or the 
polymerase chain reaction. 

Since these Iigands have equivalent binding affinity to that of complete Ig molecules, the Iigands can be 
used in many of the ways as are Ig molecules or fragments. For example, Ig molecules have been used In 
therapy (such as in treating cancer, bacterial and viral diseases), In diagnosis (such as pregnancy testing). 

45 in vaccination (such as in producing anti-idlotypic antibodies which mimic antigens), in modulation of 
activities of hormones or growth factors, in detection, in biosensors and in catalysis. 

It is envisaged that the small size of the Iigands may confer some advantages over complete 
antibodies, for example, in neutralising the activity of low molecular weight drugs (such as digoxin) and 
allowing their filtration from the kidneys with drug attached; in penetrating tissues and tumours; in 

so neutralising viruses by binding to small conserved regions on the surfaces of viruses such as the "canyon" 
sites of viruses [16]; In high resolution epitope mapping of proteins; and in vaccination by Iigands which 
mimic antigens. 

A single domain ligand may be linked to one or more of an effector molecule, a label, a surface, or one 
or more other Iigands having the same or different specificity, forming a "receptor". 
55 A receptor comprising a ligand linked to an effector molecule may be of use in therapy. The effector 
molecule may be a toxin, such as ricin or pseudomonas exotoxin, an enzyme which is able to activate a 
prodrug, a binding partner or a radio-isotope. The radio-isotope may be directly linked to the ligand or may 
be attached thereto by a chelating structure which is directly linked to the ligand. Such Iigands with 


4 


EP 0 368 684 B1 


attached isotopes are much smaller than those based on Fv fragments, and could penetrate tissues and 
access tumours more readily. 

A receptor comprising a ligand linked to a label may be of use in diagnosis. The label may be a heavy 
metal atom or a radio-isotope, in which case the receptor can be used for in vivo imaging using X-ray or 

5 other scanning apparatus. The metal atom or radio-isotope may be attached to the ligand either directly or 
via a chelating structure directly linked to the ligand. For in vitro diagnostic testing, the label may be a 
heavy metal atom, a radio-isotope, an enzyme, a fluorescent or coloured molecule or a protein or peptide 
tag which can be detected by an antibody, an antibody fragment or another protein. Such receptors would 
be used in any of the known diagnostic tests, such as ELISA or fluorescence-linked assays. 

70 A receptor comprising a ligand linked to a surface, such as a chromatography medium, could be used 
for purification of other molecules by affinity chromatography. Linking of ligands to cells, for example to the 
outer membrane proteins of E. coli or to hydrophobic tails which localise the ligands in the cell membranes, 
could allow a simple diagnostic test in which the bacteria or cells would agglutinate in the presence of 
molecules bearing multiple sites for binding the ligand(s). 

75 Receptors comprising at least two ligands can be used, for instance, in diagnostic tests. The first ligand 
will bind to a test antigen and the second ligand will bind to a reporter molecule, such as an enzyme, a 
fluorescent dye. a coloured dye, a radio-isotope or a coloured-, fluorescently- or radio-labelled protein. 

Alternatively, such receptors may be useful in increasing the binding to an antigen. The first ligand will 
bind to a first epitope of the antigen and the second ligand will bind to a second epitope. Such receptors 

20 may also be used for increasing the affinity and specificity of binding to different antigens in close proximity 
on the surface of cells. The first ligand will bind to the first antigen and the second epitope to the second 
antigen: strong binding will depend on the co-expression of the epitopes on the surface of the cell. This 
may be useful in therapy of tumours, which can have elevated expression of several surface markers. 
Further ligands could be added to further Improve binding or specificity. Moreover, the use of strings of 

25 ligands. with the same or multiple specificities, creates a larger molecule which is less readily filtered from 
the circulation by the kidney. 

For vaccination with ligands which mimic antigens, the use of strings of ligands may prove more 
effective than single ligands, due to repetition of the immunising epitopes. 

If desired, such receptors with multiple ligands could include effector molecules or labels so that they 

30 can be used in therapy or diagnosis as described above. 

The ligand may be linked to the other part of the receptor by any suitable means, for instance by 
covalent or non-covalent chemical linkages. However, where the receptor comprises a ligand and another 
protein molecule, rt is preferred that they are produced by recombinant DMA technology as a fusion 
product. If necessary, a linker peptide sequence can be placed between the ligand and the other protein 

35 molecule to provide flexibility. 

The basic techniques for manipulating Ig molecules by recombinant DNA technology are described in 
the patent references cited above. These may be adapted in order to allow for the production of single 
domain ligands and receptors by means of recombinant DNA technology. 

Preferably, where the ligand is to be used for in vivo diagnosis or therapy in humans, it is humanised. 

40 for instance by CDR replacement as described in EP-A-0 239 400. 

In order to obtain a DNA sequence encoding a ligand, it is generally necessary firstly to produce a 
hybridoma which secretes an appropriate MAb. This can be a very time consuming method. Once an 
immunised animal has been produced, it is necessary to fuse separated spleen cells with a suitable 
myeloma cell line, grow up the cell lines thus produced, select appropriate lines, redone the selected lines 

45 and reselect. This can take some long time. This problem also applies to the production of modified Igs. 

A further problem with the production of ligands. and also receptors described above and modified Igs, 
by recombinant DNA technology is the cloning of the variable domain encoding sequences from the 
hybridoma which produces the MAb from which the specificity is to be derived. This can be a relatively 
long method involving the production of a suitable probe, construction of a clone library from cDNA or 

50 genomic DNA, extensive probing of the clone library, and manipulation of any isolated clones to enable the 
cloning into a suitable expression vector. Due to the inherent variability of the DNA sequences encoding Ig 
variable domains, it has not previously been possible to avoid such time consuming wori<. It is therefore a 
further aim of the present invention to provide a method which enables substantially any sequence 
encoding an ig superfamily molecule variable domain (ligand) to be cloned in a reasonable period of time. 

55 According to an aspect of the present invention therefore, there is provided a method of cloning a 
sequence (the target sequence) which encodes at least part of the variable domain of an Ig superfamily 
molecule, which method comprises: 

(a) providing a sample of double stranded (ds) nucleic acid which contains the target sequence; 
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(b) denaturing the sample so as to separate the two strands; 

(c) annealing to the sample a forward and a back oligonucleotide primer, the iomard primer being 
specific for a sequence at or adjacent the 3' end of the sense strand of the target sequence, the back 
primer being specific for a sequence at or adjacent the 3* end of the antisense strand of the target 

5 sequence, under conditions which allow the primers to hybridise to the nucleic acid at or adjacent the 
target sequence; 

(d) treating the annealed sample with a DNA polymerase enzyme In the presence of deoxynucleosrde 

triphosphates under conditions which cause primer extension to take place; and 

(e) denaturing the sample under conditions such that the extended primers become separated from the 
10 target sequence. 

Preferably, the method of the present invention further includes the step (f) of repeating steps (c) to (e) 
on the denatured mixture a plurality of times. 

Preferably, the method of the present invention is used to clone complete variable domains from Ig 
molecules, most preferably from Ig heavy chains. 

75 In step (c) recited above, the forward primer becomes annealed to the sense strand of the target 
sequence at or adjacent the 3' end of the strand. In a similar manner, the back primer becomes annealed to 
the antisense strand of the target sequence at or adjacent the 3* end of the strand. Thus, the forward primer 
anneals at or adjacent the region of the ds nucleic acid which encodes the C terminal end of the variable 
region or domain. Similariy. the back primer anneals at or adjacent the region of the ds nucleic acid which 

20 encodes the N-terminal end of the variable domain. 

In step (d), nucleotides are added onto the 3* end of the forward and back primers in accordance with 
the sequence of the strand to which they are annealed. Primer extension will continue In this manner until 
stopped by the beginning of the denaturing step (e). It must therefore be ensured that step (d) is carried out 
for a long enough time to ensure that the primers are extended so that the extended strands totally overlap 

25 one another. 

In step (e), the extended primers are separated from the ds nucleic acid. The ds nucleic acid can then 
serve again as a substrate to which further primers can anneal. Moreover, the extended primers themselves 
have the necessary complementary sequences to enable the primers to anneal thereto. 

During further cycles, if step (f) is used, the amount of extended primers will increase exponentially so 
30 that at the end of the cycles there will be a large quantity of cDNA having sequences complementary to the 
sense and antisense strands of the target sequence. Thus, the method of the present invention will result in 
the accumulation of a large, quantity of cDNA which can form ds cDNA encoding at least part of the variable 
domain. 

As will be apparent to the skilled person, some of the steps in the method may be carried out 

35 simultaneously or sequentially as desired. 

The forward and back primers may be provided as isolated oligonucleotides, In which case only two 
oligonucleotides will be used. However, alternatively the forward and back primers may each be supplied as 
a mixture of closely related oligonucleotides. For instance, it may be found that at a particular point in the 
sequence to which the primer is to anneal, there Is the possibility of nucleotide variation. In this case a 

40 primer may be used for each possible nucleotide variation. Furthermore it may be possible to use two or 
more sets of "nested" primers in the method to enhance the specific cloning of variable region genes. 

The method described above is similar to the method described by Saiki et al. [17]. A similar method is 
also used in the methods described in EP-A-0 200 362. In both cases the method described is can'ied out 
using primers which are known to anneal efficiently to the specified nucleotide sequence. In neither of these 

45 disclosures was it suggested that the method could be used to clone Ig parts of variable domain encoding 
sequences, where the target sequence contains inherently highly variable areas. 

The ds nucleic acid sequence used in the method of the present invention may be derived from mRNA. 
For instance. RNA may be isolated in known manner from a cell or cell line which is known to produce Igs. 
mRNA may be separated from other RNA by oligo-dT chromatography. A complementary strand of cDNA 

50 may then be synthesised on the mRNA template, using reverse transcriptase and a suitable primer, to yield 
an RNA/ONA heteroduplex. A second strand of DNA can be made in one of several ways, for example, by 
priming with RNA fragments of the mRNA strand (made by incubating RNA/DNA heteroduplex with RNase 
H) and using DNA polymerase, or by priming with a synthetic oligodeoxynucleotide primer which anneals to 
the y end of the first strand and using DNA polymerase. It has been found that the method of the present 

55 Invention can be carried out using ds cDNA prepared in this way. 

When making such ds cDNA. it Is possible to use a forward primer which anneals to a sequence in the 
CHI domain (for a heavy chain variable domain) or the CX or C* domain (for a light chain variable domain). 
These will be located in close enough proximity to the target sequence to allow the sequence to be cloned. 
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The back primer may be one which anneals to a sequence at the N-terminal end of the VH1, Vx or VX 
domain. The back primer may consist of a plurality of primers having a variety of sequences designed to be 
complementary to the various families of VH1, Vx or VX sequences known. Alternatively the back primer 
may be a single primer having a consensus sequence derived from all the families of variable region genes. 
5 Surprisingly, it has been found that the method of the present invention can be carried out using 
genomic DNA. If genomic DNA is used, there Is a very large amount of DNA present, including actual 
coding sequences, Introns and untranslated sequences between genes. Thus, there is considerable scope 
for non-specific annealing under the conditions used. However, it has surprisingly been found that there is 
very little non-specific annealing. It is therefore unexpected that it has proved possible to clone the genes of 
70 Ig-variable domains from genomic DNA. 

Under some circumstances the use of genomic DNA may prove advantageous compared with use of 
mRNA, as the mRNA is readily degraded, and especially difficult to prepare from clinical samples of human 
tissue. 

Thus, in accordance with an aspect of the present Invention, the ds nucleic acid used in step (a) is 
15 genomic DNA. 

When using genomic DNA as the ds nucleic acid source, it wilt not be possible to use as the forward 
primer an oligonucleotide having a sequence complementary to a sequence in a constant domain. This Is 
because, in genomic DNA, the constant domain genes are generally separated from the variable domain 
genes by a considerable number of base pairs. Thus, the site of annealing would be too remote from the 

20 sequence to be cloned. 

It should be noted that the method of the present Invention can be used to clone both rearranged and 
unrearranged variable domain sequences from genomic DNA. It is known that in germ line genomic DNA 
the three genes, encoding the VH, DH and JH respectively, are separated from one another by considerable 
numbers of base pairs. On maturation of the Immune response, these genes are rearranged so that the VH, 

25 DH and JH geries are fused together to provide the gene encoding the whole variable domain (see Figure 
1). By using a forward primer specific for a sequence at or adjacent the 3' end of the sense strand of the 
genomic "unrearranged" VH gene, It Is possible to clone the "unrearranged" VH gene alone, without also 
cloning the DH and JH genes. This can be of use in that it will then be possible to fuse the VH gene onto 
pre-cloned or synthetic DH and DH genes. In this way, rearrangement of the variable domain genes can be 

30 carried out in vitro. 

The oligonucleotide primers used In step (c) may be specifically designed for use with a particular 
target sequence. In this case, it will be necessary to sequence at least the 5' and 3' ends of the target 
sequence so that the appropriate oligonucleotides can be synthesised. However, the present Inventors have 
discovered that it is not necessary to use such specifically designed primers. Instead, it Is possible to use a 

35 species specific general primer or a mixture of such primers for annealing to each end of the target 
sequence. This is not particularly surprising as regards the 3' end of the target sequence. It is known that 
this end of the variable domain encoding sequence leads into a segment encoding JH which Is known to be 
relatively conserved. However, it was surprisingly discovered that, within a single species, the sequence at 
the 5' end of the target sequence is sufficiently well conserved to enable a species specific general primer 

40 or a mixture thereof to be designed for the 5' end of the target sequence. 

Therefore according to a preferred aspect of the present invention, in step (c) the two primers which are 
used are species specific general primers, whether used as single primers or as mixtures of primers. This 
greatly facilitates the cloning of any undetermined target sequence since It will avoid the need to carry out 
any sequencing on the target sequence in order to produce target sequence-specific primers. Thus the 

45 method of this aspect of the invention provides a general method for cloning variable region or domain 
encoding sequences of a particular species. 

Once the variable domain gene has been cloned using the method described above, it may be directly 
inserted into an expression vector, for instance using the PGR reaction to paste the gene into a vector. 
Advantageously, however, each primer includes a sequence including a restriction enzyme recognition 

50 site. The sequence recognised by the restriction enzyme need not be in the part of the primer which 
anneals to the ds nucleic acid, but may be provided as an extension which does not anneal. The use of 
primers with restriction sites has the advantage that the DNA can be cut with at least one restriction enzyme 
which leaves 3' or 5' overhanging nucleotides. Such DNA is more readily cloned into the corresponding 
sites on the vectors than blunt end fragments taken directly from the method. The ds cDNA produced at the 

55 end of the cycles will thus be readily insertable Into a cloning vector by use of the appropriate restriction 
enzymes. Preferably the choice of restriction sites is such that the ds cDNA is cloned directly into an 
expression vector, such that the (igand encoded by the gene is expressed. In this case the restriction site Is 
preferably located in the sequence which Is annealed to the ds nucleic acid. 
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Since the primers may not have a sequence exactly complementary to the target sequence to which it 
is to be annealed, for instance because of nucleotide variations or because of the introduction of a 
restriction enzyme recognition site, it may be necessary to adjust the conditions in the annealing mixture to 
enable the primers to anneal to the ds nucleic acid. This is well within the competence of the person skilled 

5 In the art and needs no further explanation. 

In step (d), any DNA polymerase may be used. Such polymerases are known in the art and are 
available commercially. The conditions to be used with each polymerase are well known and require no 
further explanation here. The polymerase reaction will need to be carried out in the presence of the four 
nucleoside triphosphates. These and the polymerase enzyme may already be present In the sample or may 

70 be provided afresh for each cycle. 

The denaturing step (e) may be carried out. for instance, by heating the sample, by use of chaotropic 
agents, such as urea or guanidine, or by the use of changes in ionic strength or pH. Preferably, denaturing 
is carried out by heating since this is readily reversible. Where heating is used to carry out the denaturing, 
it will be usual to use a thermostable DNA polymerase, such as Taq polymerase, since this will not need 

75 replenishing at each cycle. 

If heating is used to control the method, a suitable cycle of heating comprises denaturation at about 
95 'C for about 1 minute, annealing at from 30*C to 65'C for about 1 minute and primer extension at about 
75 'C for about 2 minutes. To ensure that elongation and renaturation Is complete, the mixture after the final 
cycle is preferably held at about 60 * C for about 5 minutes. 

20 The product ds cDNA may be separated from the mixture for instance by gel electrophoresis using 
agarose gels. However, if desired, the ds cDNA may be used in unpurifled form and inserted directly into a 
suitable cloning or expression vector by conventional methods. This will be particularly easy to accomplish 
if the primers include restriction enzyme recognition sequences. 

The method of the present invention may be used to make variations in the sequences encoding the 

25 variable domains. For example this may be acheived by using a mixture of related oligonucleotide primers 
as at least one of the primers. Preferably the primers are particularly variable in the middle of the primer 
and relatively conserved at the 5' and 3' ends. Preferably the ends of the primers are complementary to the 
framework regions of the variable domain, and the variable region in the middle of the primer covers all or 
part of a CDR. Preferably a forward primer is used in the area which forms the third CDR. If the method is 

30 carried out using such a mixture of oligonucleotides, the product will be a mixture of variable domain 
encoding sequences. Moreover, variations in the sequence may be introduced by incorporating some 
mutagenic nucleotide triphosphates in step (d), such that point mutations are scattered throughout the target 
region. Alternatively such point mutations are introduced by performing a large number of cycles of 
amplification, as en-ors due to the natural error rate of the DNA polymerase are amplified, particularly when 

35 using high concentrations of nucleoside triphosphates. 

The method of this aspect of the present Invention has the advantage that it greatly facilitates the 
cloning of variable domain encoding sequences directly from mRNA or genomic DNA. This in turn will 
facilitate the production of modified Ig-type molecules by any of the prior art methodes referred to above. 
Further, target genes can be cloned from tissue samples containing antibody producing cells, and the 

40 genes can be sequenced. By doing this, it will be possible to look directly at the immune repertoire of a 
patient. This "fingerprinting" of a patient's immune repertoire could be of use in diagnosis, for instance of 
auto-Immune diseases. 

In step (a) the ds cDNA is derived from mRNA. For Ig derived variable domains, the mRNA is 
preferably be isolated from lymphocytes which have been stimulated to enhance production of mRNA. 

45 In each step (c) the set of primers are preferably different from the previous step (c), so as to enhance 
the specificity of copying. Thus the sets of primers form a nested set. For example, for cloning of Ig heavy 
chain variable domains, the first set of primers may be located within the signal sequence and constant 
region, as described by Larrick et al., [18], and the second set of primers entirely within the variable region, 
as described by Orlandi et al., [19]. Preferably the primers of step (c) include restriction sites to facilitate 

50 subsequent cloning. In the last cycle the set of primers used in step (c) should preferably include restriction 
sites for introduction into expression vectors. Possible mismatches between the primers and the template 
strands may be corrected by "nick translation", ds cDNA is preferably cleaved with restriction enzymes at 
sites introduced into the primers to facilitate the cloning. 

According to another aspect of the present invention the product ds cDNA is cloned directly into an 

55 expression vector. The host may be prokaryotic or eukaryotic, but is preferably bacterial. Preferably the 
choice of restriction sites in the primers and in the vector, and other features of the vector will allow the 
expression of complete ligands, while preserving all those features of the amino acid sequence which are 
typical of the (methoded) ligands. For example, for expression of the rearranged variable genes, the primers 
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would be chosen to allow the cloning of target sequences Including at least all the three CDR sequences. 
The cloning vector would then encode a signal sequence (for secretion of the ligand), and sequences 
encoding the N-terminal end of the first framework region, restriction sites for cloning and then the C- 
terminal end of the last (fourth) framework region. 

5 For expression of unrearranged VH genes as part of complete ligands, the primers would be chosen to 
allow the cloning of target sequences including at least the first two CDRs. The cloning vector could then 
encode signal sequence, the N-terminal end of the first framework region, restriction sites for cloning and 
then the C-terminal end of the third framework region, the third CDR and fourth framework region. 

Primers and cloning vectors may likewise be devised for expression of single CDRs, particularly the 

70 third CDR, as parts of complete ligands. The advantage of cloning repertoires of single CDRs would permit 
the design of a "universal" set of framework regions, incorporating desirable properties such as solubility. 

Single ligands could be expressed alone or in combination with a complementary variable domain. For 
example, a heavy chain variable domain can be expressed either as an individual domain or, If it is 
expressed with a complementary light chain variable domain, as an antigen binding site. Preferably the two 

15 partners would be expressed in the same cell, or secreted from the same cell, and the proteins allowed to 
associate non-covalently to form an Fv fragment. Thus the two genes encoding the complementary partners 
can be placed in tandem and expressed from a single vector, the vector including two sets of restriction 
sites. 

Preferably the genes are introduced sequentially: for example the heavy chain variable domain can be 
20 cloned first and then the light chain variable domain. Alternatively the two genes are introduced into the 
vector in a single step, for example by using the polymerase chain reaction to paste together each gene 
. with. any necessary intervening sequence, as essentially described by Yon and Fried [29]. The two partners 
could be also expressed as a linked protein to produce a single chain Fv fragment, using similar vectors to 
those described above. As a further alternative the two genes may be placed in two different vectors, for 
25 example in which one vector is a phage vector and the other is a plasmid vector. 

Moreover, the cloned ds cDNA may be Inserted into an expression vector already containing sequences 
encoding one or more constant domains to allow the vector to express Ig-type chains. The expression of 
Fab fragments, for example, would have the advantage over Fv fragments that the heavy and light chains 
would tend to associate through the constant domains in addition to the variable domains. The final 
30 expression product may be any of the modified Ig-type molecules referred to above. 

The cloned sequence may also be inserted into an expression vector so that it can be expressed as a 
fusion protein. The variable domain encoding sequence may be. linked directly or via a linker sequence to a 
DNA sequence encoding any protein effector molecule, such as a toxin, enzyme, label or another ligand. 
The variable domain sequences may also be linked to proteins on the outer side of bacteria or phage. Thus, 
35 the method of this aspect of the invention may be used to produce receptors according to the invention. 

According to another aspect of the invention, the cloning of ds cDNA directly for expression permits the 
rapid. construction of expression libraries which can be screened for binding activities. For Ig heavy and 
light chain variable genes, the ds cDNA may comprise variable genes isolated as complete rearranged 
genes from the animal, or variable genes built from several different sources, for example a repertoire of 
-40 unrearranged VH genes combined with a synthetic repertoire of DH and JH genes. Preferably repertoires of 
genes encoding Ig heavy chain variable domains are prepared from lymphocytes of animals immunised 
with an antigen. 

The screening method may take a range of formats well known in the art. For example Ig heavy chain 
variable domains secreted from bacteria may be screened by binding to antigen on a solid phase, and 

45 detecting the captured domains by antibodies. Thus the domains may be screened by growing the bacteria 
in liquid culture and binding to antigen coated on the surface of ELISA plates. However, preferably bacterial 
colonies (or phage plaques) which secrete ligands (or modified ligands, or ligand fusions with proteins) are 
screened for antigen binding on membranes. Either the ligands are bound directly to the membranes (and 
for example detected with labelled antigen), or captured on antigen coated membranes (and detected with 

so reagents specific for ligands). The use of membranes offers great convenience in screening many clones, 
and such techniques are well known in the art. 

The screening method may also be greatly facilitated by making protein fusions with the ligands, for 
example by introducing a peptide tag which is recognised by an antibody at the N-terminal or C-terminal 
end of the ligand, or joining the ligand to an enzyme which catalyses the conversion of a colourless 

55 substrate to a coloured product. In the latter case, the binding of antigen may be detected simply by adding 
substrate. Alternatively, for ligands expressed and folded correctly inside eukaryotic cells, joining of the 
ligand and a domain of a transcriptional activator such as the GAL4 protein of yeast, and joining of antigen 
to the other domain of the QAL4 protein, could form the basis for screening binding activities, as described 
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by Fields and Song [21]. 

The preparation of proteins, or even cells with multiple copies of the ligands. may improve the avidity of 
the ligand for immobilised antigen, and hence the sensitivity of the screening method. For example, the 
ligand may be joined to a protein subunit of a multimeric protein, to a phage coat protein or to an outer 

6 membrane protein of E. coli such as ompA or lamB. Such fusions to phage or bacterial proteins also offers 
possibilities of selecting bacteria displaying ligands with antigen binding activities. For example such 
bacteria may be precipitated with antigen bound to a solid support, or may be subjected to affinity 
chromatography, or may be bound to larger cells or particles which have been coated with antigen and 
sorted using a fluorescence activated cell sorter (FACS). The proteins or peptides fused to the ligands are 

10 preferably encoded by the vector, such that cloning of the ds cDNA repertoire creates the fusion product. 

In addition to screening for binding activities of single ligands. it may be necessary to screen for 
binding or catalytic activities of associated ligands, for example, the associated Ig heavy and light chain 
variable domains. For example, repertoires of heavy and light chain variable genes may be cloned such that 
two domains are expressed together. Only some of the pairs of domains may associate, and only some of 

75 these associated pairs may bind to antigen. The repertoires of heavy and light chain variable domains could 
be cloned such that each domain is paired at random. This approach may be most suitable for isolation of 
associated domains in which the presence of both partners is required to form a cleft. Alternatively, to allow 
the binding of hapten. Alternatively, since the repertoires of light chain sequences are less diverse than 
those of heavy chains, a small repertoire of light chain variable domains, for example including representa- 

20 five members of each family of domains, may be combined with a large repertoire of heavy chain variable 
domains. 

Preferably however, a repertoire of heavy chain variable domains is screened first for antigen binding in 
the absence of the light chain partner, and then only those heavy chain variable domains binding to antigen 
are combined with the repertoire of light chain variable domains. Binding of associated heavy and light 
25 chain variable domains may be distinguished readily from binding of single domains, for example by fusing 
each domain to a different C-terminal peptide tag which are specifically recognised by different monoclonal 
antibodies. 

The hierarchical approach of first cloning heavy chain variable domains with binding activities, then 
cloning matching light chain variable domains may be particulariy appropriate for the construction of 

30 catalytic antibodies, as the heavy chain may be screened first for substrate binding. A light chain variable 
domain would then be identified which is capable of association with the heavy chain, and "catalytic" 
residues such as cysteine or histidine (or prosthetic groups) would be introduced Into the CDRs to stabilise 
the transition state or attack the substrate, as described by Baldwin and Schultz [22]. 

Although the binding activities of non-covalently associated heavy and light chain variable domains (Fv 

35 fragments) may be screened, suitable fusion proteins may drive the association of the variable domain 
partners. Thus Fab fragments are more likely to be associated than the Fv fragments, as the heavy chain 
variable domain is attached to a single heavy chain constant domain, and the light chain variable domain is 
attached to a single light chain variable domain, and the two constant domains associate together. 

Alternatively the heavy and light chain variable domains are covalently linked together with a peptide, as 

40 in the single chain antibodies, or peptide sequences attached, preferably at the C-terminal end which will 
associate through forming cysteine bonds or through non-covalent interactions, such as the introduction of 
"leucine zipper" motifs. However, in order to isolate pairs of tightly associated variable domains, the Fv 
fragments are preferably used. 

The constnjction of Fv fragments Isolated from a repertoire of variable region genes offers a way of 

45 building complete antibodies, and an altemative to hybridoma technology. For example by attaching the 
variable domains to light or suitable heavy chain constant domains, as appropriate, and expressing the 
assembled genes in mammalian cells, complete antibodies may be made and should possess natural 
effector functions, such as complement lysis. This route is particulariy attractive for the construction of 
human monoclonal antibodies, as hybridoma technology has proved difficult, and for example, although 

50 human peripheral blood lymphocytes can be immortalised with Epstein Barr virus, such hybridomas tend to 
secrete low affinity IgM antibodies. 

Moreover, it is known that immmunological mechanisms ensure that lymphocytes do not generally 
secrete antibodies directed against host proteins. However it is desirable to make human antibodies 
directed against human proteins, for example to human cell surtace markers to treat cancers, or to 

55 histocompatibility antigens to treat auto-immune diseases. The construction of human antibodies built from 
the combinatorial repertoire of heavy and light chain variable domains may overcome this problem, as it will 
allow human antibodies to built with specificities which would nomnally have been eliminated. 
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The method also offers a new way of making bispecific antibodies. Antibodies with dual specificity can 
be made by fusing two hybndomas of different specificities, so as to make a hybrid antibody with an Fab 
arm of one specificity, and the other Fab arm of a second specificity. However the yields of the bispecific 
antibody are low, as heavy and light chains also find the wrong partners. The construction of Fv fragments 
5 which are tightly associated should preferentially drive the association of the correct pairs of heavy with 
light chains. (It would not assist in the correct pairing of the two heavy chains with each other.) The 
improved production of bispecific antibodies would have a variety of applications In diagnosis and therapy, 
as Is well known. 

Thus the invention provides a species specific general oligonucleotide primer or a mixture of such 

10 primers useful for cloning variable domain encoding sequences from animals of that species. The method 
allows a single pair or pair of mixtures of species specific general primers to be used to clone any desired 
antibody specificity from that species. This eliminates the need to carry out any sequencing of the target 
sequence to be cloned and the need to design specific primers for each specificity to be recovered. 

Furthermore it provides for the construction of repertoires of variable genes, for the expression of the 

75 variable genes directly on cloning, for the screening of the encoded domains for binding activities and for 
the assembly of the domains with other variable domains derived from the repertoire. 

Thus the use of the method of the present invention will allow for the production of heavy chain variable 
domains with binding activities and variants of these domains. It allows for the production of ncionoclonal 
antibodies and bispecific antibodies, and will provide an alternative to hybridoma technology. For instance, 

20 mouse splenic ds mRNA or genomic DNA may be obtained from a hyperimmunised mouse. This could be 
cloned using the method of the present invention and then the cloned ds DNA inserted into a suitable 
expression vector. The expression vector would be used to transform a host cell, for instance a bacterial 
cell, to enable it to produce an Fv fragment or a Fab fragment. The Fv or Fab fragment would then be built 
Into a monoclonal antibody by attaching constant domains and expressing It in mammalian cells. 

25 The present invention is now described, by way of example only, with reference to the accompanying 
drawings in which: 

Figure 1 shows a schematic representation of the unrearranged and rearranged heavy and light chain 
variable genes and the location of the primers; 

Figure 2 shows a schematic representation of the M13-VHPCR1 vector and a cloning scheme for 
30 amplified heavy chain variable domains; 

Figure 3 shows the sequence of the Ig variable region derived sequences in M13-VHPCR1; 

Figure 4 shows a schematic representation of the M13-VKPCR1 vector and a cloning scheme for light 

chain variable domains; 

Figure 5 shows the sequence of the Ig variable region derived sequences in M13-VKPCR1; 
35 Figure 6 shows the nucleotide sequences of the heavy and light chain variable domain encoding 
sequences of MAb MBrI; 

Rgure 7 shows a schematic representation of the pSV-gpt vector (also known as a-Lys 30) which 
contains a variable region cloned as a Hind Ill-Bam HI fragment, which is excised on Introducing the new 
variable region. The gene for human lgGl has also been engineered to remove a BamHI site, such that 

40 the BamHI site in the vector is unique; 

Figure 8 shows a schematic representation of the pSV-hygro vector (also known as a-Lys 17). It is 
derived from pSV gpt vector with the gene encoding mycophenolic acid replaced by a gene coding for 
hygromycin resistance. The construct contains a'variable gene cloned as a Hindlll-BamHI fragment 
which is excised on introducing the new variable region. The gene for human Ca has also been 

45 engineered to remove a BamHI site, such that the BamHI site in the vector is unique; 
Figure 9 shows the assembly of the mouse: human MBr1 chimaeric antibody; 
Figure 10 shows encoded amino acid sequences of 48 mouse rearranged VH genes; 
Figure 1 1 shows encoded amino acid sequences of human rearranged VH genes; 
Figure 12 shows encoded amino acid sequences of unrearranged human VH genes; 

50 Figure 13 shows the sequence of part of the plasmid pSWI: essentially the sequence of a pectate lyase 
leader linked to VHLYS in pSW1 and cloned as an Sphl-EcoRI fragment into pUCl9 and the translation 
of the open reading frame encoding the pectate lyase leader-VHLYS polypeptide being shown; 
Figure 14 shows the sequence of part of the plasmid pSW2: essentially the sequence of a pectate lyase 
leader linked to VHLYS and to VKLYS, and cloned as an Sphl-EcoRI-EcoRI fragment into pUCl9 and the 

55 translation of open reading frames encoding the pectate lyase leader-VHLYS and pectate lyase leader- 
VKLYS polypeptides being shown; 

Figure 15 shows the sequence of part of the plasmid pSWIHPOLYMYC which is based on pSWI and in 
which a polylinker sequence has replaced the variable domain of VHLYS, and acts as a cloning site for 
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amplified VH genes, and a peptide tag is introduced at the C-termihal end; 

Figure 16 shows the encoded annino.acid sequences of two VH domains derived from mouse spleen and 
having lysozyme binding activity, and compared with the VH domain of the D1,3 antibody. The arrows 
mark the points of difference between the two VH domains; 
5 Rgure 17 shows the encoded amino acid sequence of a VH domain derived from human peripheral 
blood lymphocytes and having lysozyme binding activity; 

Rgure 18 shows a scheme for generating and cloning mutants of the VHLYS gene, which is compared 
with the scheme for cloning natural repertoires of VH genes; 
Rgure 19 shows the sequence of part of the vector pSW2HP0LY; 
10 Rgure 20 shows the sequence of pari of the vector pSW3 which encodes the two linked VHLYS 
domains; 

Rgure 21 shows the sequence of the VHLYS domain and pelB leader sequence fused to the alkaline 

phosphatase gene; 

Rgure 22 shows the sequence of the vector pSWIVHLYSVKPOLYMYC for expression of a repertoire of 
75 Vx light chain variable domains in association with the VHLYS domain; and 

Rgure 23 shows the sequence of VH domain which is secreted at high levels from E. coli. The 
differences with VHLYS domain are marked. 

PRIMERS 

20 

In the Examples described below, the following oligonucleotide primers, or mixed primers were used. 
Their locations are marked on Rgure 1 and sequences are as follows: 


25 


30 


35 


40 


45 


50 


55 
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VHIFOR 5 
VHlFOR-2 5 

HulVHFOR 5 
HU2VHF0R 5 
HU3VHF0R 5 
HU4VHF0R 5 

MOJHIFOR 5 
M0JH2F0R 5 
M0JH3F0R 5 
M0JH4F0R 5 

HUJHIFOR 5 
HUJH2F0R 5 
HUJH4F0R 5 

VKIFOR 5 
VK2F0R 5 
VK3F0R 5 

MOJKIFOR 5 
M0JK3F0R 5 
M0JK4F0R 5 

HUJKIFOR 5 

HUJK3F0R 5 

HUJK4F0R 5 

HUJKSFOR 5.' 


TGAGGAGACGGTGACCGTGGTCCCTTGGCCCCAG 3 • ; 
TGAGGAGACGGTGACCGTGGTCCCTTGGCCCC 3 ' ; 

CTTGGTGGAGGCTGAGGAGACGGTGACC 3 • ; 
CTTGGTGGAGGCTGAGGAGACGGTGACC 3 • ; 
CTTGGTGGATGCTGAGGAGACGGTGACC 3 ' ; 
CTTGGTGGATGCTGATGAGACGGTGACC 3 ' ; 


TGAGGAGACGGTGACCGTGGTCCCTGCGCCCCAG 3 ' ; 
TGAGGAGACGGTGACCGTGGTGCCTTGGCCCCAG 3 • ; 
TGCAGAGACGGTGACCAGAGTCCCTTGGCCCCAG 3 • ; 
TGAGGAGACGGTGACCGAGGTTCCTTGACCCCA6 3 • ; 

TGAGGAGACGGTGACCAGGGTGCCCTGGCCCCAG 3 • ; 
TGAGGAGACGGTGACCAGGGTGCCACGGCCCCAG 3 ' ; 
TGAGGAGACGGTGACCAGGGTTCCTTGGCCCCAG 3 • ; 

GTTAGATCTCCAGCTTGGTCCC 3'' ; 
CGTTAGATCTCCAGCTTGGTCCC 3 • ; 
CCGTTTCAGCTCGAGCTTGGTCCC 3 ' ; 

CGTTAGATCTCCAGCTTGGTCCC 3 • ; 
GGTTAGATCTCCAGTCTGGTCCC 3 • ; 
CGTTAGATCTCCAACTTTGTCCC 3 ' ; 

CGTTAGATCTCCAGCTTGGTCCC 3«; 
CGTTAGATCTCCACTTTGGTCCC 3 • ; 
CGTTAGATCTCCACCTTGGTCCC 3'; 
CGTTAGATCTCCAGTCGTGTCCC 3 • ; 


VHIBACK 5' AGGT(C/G) (C/A)A(G/A)CTGCAG(G/C)AGTC(T/A)GG 3' 
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HU2VHIBACK: 5 

HuVHIIBACK: 5 

HU2VHIIIBACK: 5 

HuVHIVBACK: 5 

MOVHIBACK 5 

MOVHIIABACK 5 

MOVHIIBBACK 5 

MOVHIIBACK 5 

VKIBACK 5 

VK2BACK 5 

MOVKIIABACK 5 

MOVKIIBBACK 5 

HuHeplFOR 5 ' 

HuOctalBACK 5 

HuOcta2BACK 5 

HuOcta3BACK 5 


CAGGTGCAGCTGCAGCAGTCTGG 3 • ; 
CAGGTGCAGCTGCAGGAGTCGGG 3 ' ; 
GAGGTGCAGCTGCAGGAGTCTGG 3 ' ; 


AGGTGCAGCTGCAGGAGTCAG 3 • ; 
AGGTCCAGCTGCAGCA (G/A) TCTGG 3 • ; 
AGGTCCAACTGCAGCAGCCTGG 3 ' ; 
AGGTGAAGCTGCAGGAGTCTGG 3 • ; 


GACATTCAGCTGACCCAGTCTCCA 3 • ; 
GACATTGAGCTCACCCAGTCTCCA 3 • ; 

GATGTTCAGCTGACCCAAACTCCA 3 ' 
GATATTCAGCTGACCCAGGATGAA 3 • ; 

C ( A/ G ) ( 0/ G ) TGAGCTCACTGTGTCTCTCGCACA 3 • ; 
CGTGAATATGCAAATAA 3 ' ; 
AGTAGGAGACATGCAAAT 3 ' ; and 
CACCACCCACATGCAAAT 3»; 


VHMUTl 


M13 PRIMER 


) ' GGAGACGGTGACCGTGGTCCCTTGGCCCCAGTAGTCAAG 
NNNNNNNNNNNNCTCTCTGGC 3» (where N is an 
eguimolar mixture of T, C, G and A) 
5' AACAGCTATGACCATG 3' (New England Biolabs 
*1201) 


EXAMPLE 1 

Cloning of Mouse Rearranged Variable region genes from hybridomas, assembly of genes encoding 
chlmaeric antibodies and the expression of antibodies from myeloma cells 

VH1F0R is designed to anneal with the 3' end of the sense strand of any mouse heavy chain variable 
domain encoding sequence. It contains a BstEII recognition site. VK1F0R is designed to anneal with the 3' 
end of the sense strand of any mouse kappa-type light chain variable domain encoding sequence and 
contains a Bglll recognition site. VH1BACK is designed to anneal with the 3' end of the antisense strand of 
any mouse heavy chain variable domain and contains a PstI recognition site. VK1BACK is designed to 
anneal with the 3' end of the antisense strand of any mouse kappa-type light chain variable domain 
encoding sequence and contains a Pvul! recognition site. 

In this Example five mouse hybridomas were used as a source of ds nucleic acid. The hybridomas 
produce monoclonal antibodies (MAbs) designated MBr1 [23], BW431/26 [24], BW494/32 [25], BW250/183 
[24,26] and BW704/152 [27]. MAb MBrI is particularly interesting in that it is known to be specific for a 
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saccharide epitope on a human mammary carcinoma line MCF-7 [28]. 
Cloning via mRNA 

5 Each of the five hybridomas referred to above was grown up in roller bottles and about 5x10^ cells of 
each hybridoma were used to isolate RNA. mRNA was separated from the Isolated RNA using oligodT 
cellulose [29]. First strand cDNA was synthesised according to the procedure described by Maniatts et al. 
[30] as set out below. 

In order to clone the heavy chain variable domain encoding sequence, a 50 ul reaction solution which 
10 contains 10 ug mRNA, 20 pmole VH1F0R primer, 250 iliM each of dATP, dTTP, dCTP and dGTP, 10 mM 
dithiothreitol (DTT), 100 mM Tris.HCI. 10 mM MgCb and 140 mM KCI, adjusted to pH 8.3 was prepared. 
The reaction solution was heated at 70*C for ten minutes and allowed to cool to anneal the primer to the 3' 
end of the variable domain encoding sequence in the mRNA. To the reaction solution was then added 46 
units of reverse transcriptase (Anglian Biotec) and the solution was then incubated at 42'C for 1 hour to 
15 cause first strand cDNA synthesis. 

In order to clone the light chain variable domain encoding sequence, the same procedure as set out 
above was used except that the VK1F0R primer was used in place of the VH1F0R primer. 

Amplification from RNA/DNA hybrid 

20 

Once the ds RNA/DNA hybrids had been produced, the variable domain encoding sequences were 
amplified as follows. For heavy chain variable domain encoding sequence amplification, a 50 ul reaction 
solution containing 5 ul of the ds RNA/DNA hybrid-containing solution, 25 pmole each of VH1F0R and 
VH1BACK primers. 250 uM of dATP. dTTP. dCTP and dGTP. 67 mM Tris.HCI, 17 mM ammonium 

25 sulphate, 10 mM MgCb. 200 ug/ml gelatine and 2 units Taq polymerase (Cetus) was prepared. The 
reaction solution was overlaid with paraffin oil and subjected to 25 rounds of temperature cycling using a 
Techne PHC-1 programmable heating block. Each cycle consisted of 1 minute and 95*0 (to denature the 
nucleic acids). 1 minute at 30*C (to anneal the primers to the nucleic acids) and 2 minutes at 72*C (to 
cause elongation from the primers). After the 25 cycles, the reaction solution and the oil were extracted 

30 twice with ether, once with phenol and once with phenol/CHCI3. Thereafter ds cDNA was precipitated with 
ethanol. The precipitated ds cDNA was then taken up in 50 ul of water and frozen. 

The procedure for light chain amplification was exactly as described above, except that the VK1 FOR 
and VK1BACK primers were used In place of the VH1F0R and VH1BACK primers respectively, 

5 Ul of each sample of amplified cDNA was fractionated on 2% agarose gels by electrophoresis and 

35 stained with ethidium bromide. This showed that the amplified ds cDNA gave a major band of the expected 
size (about 330 bp). (However the band for VK DNA of MBrI was very weak. It was therefore excised from 
the gel and reamplified In a second round.) Thus by this simple procedure, reasonable quantities of ds DNA 
encoding the light and heavy chain variable domains of the five MAbs were produced. 

40 Heavy Chain Vector Construction 

A BstElI recognition site was introduced Into the vector M13-HuVHNP [31] by site directed mutagenesis 
[32.33] to produce the vector M13-VHPCR1 (Figures 2 and 3), 

Each amplified heavy chain variable domain encoding sequence was digested with the restriction 
45 enzymes PstI and BstEII. The fragments were phenol extracted, purified on 2% low melting point agarose 
gels and force cloned into vector M13-VHPCR1 which had been digested with PstI and BstEII and purified 
on an 0.8% agarose gel. Clones containing the variable domain inserts were identified directly by 
sequencing [34] using primers based In the 3' non-coding variable gene In the M13-VHPCR1 vector. 

There is an Internal PstI site in the heavy chain variable domain encoding sequences of BW431/26. This 
50 variable domain encoding sequence was therefore assembled in two steps. The 3' Pstl-BstEII fragment was 
first cloned Into M13-VHPCR1 . followed in a second step by the 5' PstI fragment. 

Light Chain Vector Construction 

55 Vector M13mp18 [35] was cut with Pvull and the vector backbone was blunt ligated to a synthetic 
Hindlll-BamHI polyllnker. Vector M13-HuVKLYS [36] was digested with Hindlll and BamHI to Isolate the 
HuVKLYS gene. This Hindlll-BamHI fragment was then inserted into the Hindlll-BamHl polylinker site to 
form a vector M13-VKPCR1 which lacks any Pvull sites in the vector backbone (Rgures 4 and 5). This 
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vector was prepared in E Coli JM110 [22] to avoid dam methylation at the Bell site. 

Each amplified light chain variable domain encoding sequence was digested with Pvull and Bglll. The 
fragments were phenol extracted, purified on 2% low melting point agarose gels and force cloned Into 
vector M13-vkPCR1 which had been digested with Pvull and Boll, purified on an 0.8% agarose gel and 
5 treated with calf Intestinal phosphatase. Clones containing the light chain variable region inserts were 
identified directly by sequencing [34] using primers based in the 3' non-coding region of the variable 
domain In the M13-VKPCR1 vector. 

The nucleotide sequences of the MBr1 heavy and light chain variable domains are shown in Rgure 6 
with part of the flanking regions of the M13-VHPCR1 and M13-VKPCR1 vectors. 

10 

Antibody Expression 

The HIndlll-BamHI fragment carrying the MBr1 heavy chain variable domain encoding sequence in 
M13-VHPCR1 was recloned into a pSV-gpt vector with human 7I constant regions [37] (Figure 7). The 

75 MBrI light chain variable domain encoding sequence in M13-VKPCR1 was recloned as a Hindlll-BamHI 
fragment Into a pSV vector, PSV-hyg-HuCK with a hygromycln resistance marker and a human kappa 
constant domain (Figure 8). The assembly of the genes is summarised in Figure 9. 

The vectors thus produced were linearised with Pvul On the case of the pSV-hygro vectors the Pvul 
digest is only partial) and cotransfected into the non-secreting mouse myeloma line NSO [38] by 

20 electroporation [39]. One day after cotransfection. cells were selected in 0.3 ug/ml mycophenolic acid 
(MPA) and after seven days in lug/ml MPA. After 14 days, four wells, each containing one or two major 
colonies, were screened by Incorporation of ^*C-lysine [40] and the secreted antibody detected after 
precipitation with protein-A Sepharose^" (Pharmacia) on SDS-PAGE [41]. The gels were stained, fixed, 
soaked In a fluorographic reagent, Amplify^ (Amersham), dried and autoradiographed on pref lashed film at 

25 -70 • C for 2 days. 

Supernatant was also tested for binding to the mammary carcinoma line MCF-7 and the colon 
carcinoma line HT-29. essentially as described by Menard et al. [23], either by an Indirect Im- 
munoflorescence assay on cell suspensions (using a fluorescein-tabelled goat anti-human IgG (Amersham)) 
or by a solid phase RIA on monolayers of fixed cells (using ^25|-protein A (Amersham)). 
30. It was found that one of the supernatants from the four wells contained secreted antibody. The chimeric 
antibody in the supernatant, like the parent mouse MBr1 antibody, was found to bind to MCF-7 cells but not 
the HT-29 cells, thus showing that the specificity had been properly cloned and expressed. 

Example 2 

35 

Cloning of reanranged variable genes from genomic DNA of mouse spleen 
Preparation of DNA from spleen. 

40 The DNA from the mouse spleen was prepared in one of two ways (although other ways can be used). 

Method 1. A mouse spleen was cut into two pieces and each piece was put into a standard Eppendorf 
tube with 200 ul of PBS. The tip of a 1 ml glass pipette was closed and rounded in the blue flame of a 
Bunsen burner. The pipette was used to squash the spleen piece In each tube. The cells thus produced 
were transferred to a fresh Eppendorf tube and the method was repeated three times until the connective 

45 tissue of the spleen appeared white. Any connective tissue which has been transferred with the cells was 
removed using a drawn-out Pasteur pipette. The cells were then washed in PBS and distributed into four 
tubes. 

The mouse spleen cells were then sedlmented by a 2 minute spin In a Microcentaur centrifuge at low 
speed setting. All the supernatant was aspirated with a drawn out Pasteur pipette. If desired, at this point 

50 the celt sample can be frozen and stored at -20 "C 

. To. the cell sample (once thawed if it had been frozen) was added 500 ul of water and 5 ul of a 10% 
solution of NP-40, a non-ionic detergent. The tube was closed and a hole was punched in the lid. The tube 
was placed on a boiling water bath for 5 minutes to disrupt the cells and was then cooled on ice for 5 
minutes. The tube was then spun for 2 minutes at high speed to remove cell debris. 

55 The supernatant was transferred to a new tube and to this was added 125 ul 5M NaCl and 30 ul 1M 
MOPS adjusted to pH 7.0. The DNA in the supematant was absorised on a Quiagen 5 tip and purified 
following the manufacturer's instructions for lambda DNA. After isopropanol precipitation, the DNA was 
resuspended in 500 ul water. 
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Method 2. This method is based on the technique described in Maniatis et al. [30]. A mouse spleen was 
cut into very fine pieces and put into a 2 ml glass homogeniser. The cells were then freed from the tissue 
by several slow up and down strokes with the piston. The cell suspension was made In 500 ul phosphate 
buffered saline (PBS) and transferred to an Eppendorf tube. The cells were then spun for 2 min at low 
5 speed in a Microcentaur centrifuge. This results in a visible separation of white and red cells. The white 
cells, sedimenting slower, form a layer on top of the red cells. The supernatant was carefully removed and 
spun to ensure that all the white cells had sedlmented. The layer of white cells was resuspended in two 
portions of 500 ul PBS and transferred to another tube. 

The white cells were precipitated by spinning in the Microcentaur centrifuge at low speed for one 
70 minute. The cells were washed a further two times with 500 ul PBS. and were finally resuspended in 200 ii\ 
PBS. The white cells were added to 2.5 ml 25 mM EDTA and 10 mM Tris.CI, pH 7.4, and vortexed slowly. 
While vortexing 25 ii\ 20% SDS was added. The cells lysed immediately and the solution became viscous 
and clear. 100 rI of 20 mg/ml proteinase K was added and incubated one to three hours at 50* C. 

The sample was extracted with an equal volume of phenol and the same volume of chloroform, and 
75 vortexed. After centrifuging, the aqueous phase was removed and 1/10 volume 3M ammonium acetate was 
added. This was overlaid with three volumes of cold ethanol and the tube rocked carefully until the DNA 
strands became visible. The DNA was spooled out with a Pasteur pipette, the ethanol allowed to drip off. 
and the DNA transferred to 1 ml of 10 mM Tris.CI pH 7.4. 0.1 mM EDTA in an Eppendorf tube. The DNA 
was allowed to dissolve in the cold overnight on a roller. 

20 

Amplification from genomic DNA. 

The DNA solution was diluted 1/10 in water and boiled for 5 min prior to using the polymerase chain 
reaction (PGR). For each PGR reaction, typically 50-200 ng of DNA were used. 

25 The heavy and light chain variable domain encoding sequences in the genomic DNA isolated from the 
human PBL or the mouse spleen cells was then amplified and cloned using the general protocol described 
in the first two paragraphs of the section headed "Amplification from RN/VDNA Hybrid" In Example 1. 
except that during the annealing part of each cycle, the temperature was held at 65 • C and that 30 cycles 
were used. Furthermore, to minimise the annealing between the 3' ends of the two primers, the sample was 

30 first heated to 95 *C. then annealed at 65 *G. and only then was the Taq polymerase added. At the end of 
the 30 cycles, the reaction mixture was held at 60"C for five minutes to ensure that complete elongation 
and renaturation of the amplified fragments had taken place. 

The primers used to amplify the mouse spleen genomic DNA were VHIFOR and VH1BACK, for the 
heavy chain variable domain and VK2F0R and VK1BACK, for the light chain variable domain. (VK2F0R 

35 only differs from VK1F0R in that it has an extra C residue on the 5' end.) 

Other sets of primers, designed to optimise annealing with different families of mouse VH and Va genes 
were devised and used in mixtures with the primers above. For example, mixtures of VK1F0R, M0JK1F0R. 
M0JK3F0R and M0JK4F0R were used as fonward primers and mixtures of VK1BAGK, MOVKIIABACK and 
MOVKIIBBACK as back primers for amplification of Vx genes. Likewise mixtures of VH1F0R, M0JH1F0R, 

40 M0JH2F0R, M0JH3F0R and M0JH4F0R were used as forward primers and mixtures of VH1BACK, 
MOVHIBAGK, MOVHIIABAGK. MOVHIIBBAGK. MOVHIIIBAGK were used as backward primers for amplifica- 
tion of VH genes. 

All these heavy chain FOR primers referred to above contain a BstEll site and all the BACK primers 
referred to above contain a PstI site. These light chain FOR and BACK primers referred to above all contain 

45 Bglll and Pvull sites respectively. Light chain primers (VK3F0R and VK2BACK) were also devised which 
utilised different restriction sites. Sad and Xhol. 

Typically all these primers yielded amplified DNA of the correct size on gel electrophoresis, although 
other bands were also present. However, a problem was Identified in which the 5' and 3' ends of the forward 
and backward primers for the VH genes were partially complementary, and this could yield a major band of 

50 "primer-dimer" in which the two oligonucleotides prime on each other. For this reason an improved forward 
primer, VH1F0R-2 was devised in which the two 3' nucleotides were removed from VHIFOR. 

Thus, the preferred amplification conditions for mouse VH genes are as follows: the sample was made 
in a volume of 50-100 al. 50-100 ng of DNA, VH1F0R-2 and VHIBACK primers (25 pmole of each), 250 
uM of each deoxynucleotide triphosphate, 10 mM Tris.HGI, pH 8.8, 50 mM KCI, 1.5 mM MgCb. and 100 

55 ag/ml gelatine. The sample was overlaid with paraffin oil. heated to 95* C for 2 min, 65* C for 2 min. and 
then to 72*0: taq polymerase was added after the sample had reached the elongation temperature and the 
reaction continued for 2 min at 72* C. The sample was subjected to a further 29 rounds of temperature 
cycling using the Techne PHG-1 programmable heating block. 
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The preferred amplification conditions for mouse Vk genes from genomic DNA are as follows: the 
sample treated as above except with Vx primers, for example VK3F0R and VK2BACK. and using a cycle of 
94 • C for one minute, 60 • C for one minute and 72 • C for one minute. 

The conditions which were devised for genomic DNA are also suitable for amplification from the cDNA 
5 derived from mRNA from mouse spleen or mouse hybridoma. 

Cloning and analysis of variable region genes 

The reaction mixture was then extracted twice with 40 ul of water-saturated diethyl ether. This was 
10 followed by a standard phenol extraction and ethanol precipitation as described In Example 1. The DNA 
pellet was then dissolved in 100 ul 10 mM Trts.CI, 0.1 mM EDTA. 

Each reaction mixture containing a light chain variable domain encoding sequence was digested with 
Sad and Xhol (or with Pvull and Bglll) to enable it to be llgated into a suitable expression vector. Each 
reaction mixture containing a heavy chain variable domain encoding sequence was digested with PstI and 
75 BstEII for the same purpose. 

The heavy chain variable genes isolated as above from a mouse hyperlmmunised with lysozyme were 
cloned into M13VHPCR1 vector and sequenced. The complete sequences of 48 VH gene clones were 
determined (Rgure 10). All but two of the mouse VH gene families were represented, with frequencies of: 
VA (1). IIIC (1). IIIB (8). IIIA (3). IIB (17). IIA (2). IB (12), lA (4). In 30 clones, the D segments could be 
20 assigned to families SP2 (14), FL16 (11) and Q52 (5), and in 38 clones the JH minigenes to families JH1 
(3), JH2 (7), JH3 (14) and JH4 (14). The different sequences of CDR3 marked out each of the 48 clones as 
unique. Nine pseudogenes and 16 unproductive rearrangements were identified. Of the clones sequenced, 
27 have open reading frames. 

Thus the method is capable of generating a diverse repertoire of heavy chain variable genes from 
25 mouse spleen DNA. 

Example 3 

Cloning of rearranged variable genes from mRNA from human peripheral blood lymphocytes 

30 

Preparation of mRNA. 

Human peripheral blood lymphocytes were purified and mRNA prepared directly (Method 1). or mRNA 
was prepared after addition of Epstein Barr virus (Method 2). 

35 Method 1. 20 ml of heparinised human blood from a healthy volunteer was diluted with an equal volume 
of phosphate buffered saline (PBS) and distributed equally into 50 ml Falcon tubes. The blood was then 
underiayed with 15ml Ficoll Hypaque (Pharmacia IO-A-001-07). To separate the lymphocytes from the red 
blood cells, the tubes were spun for 10 minutes at 1800 rpm at room temperature in an lEC Centra 3E table 
centrifuge. The peripheral blood lymphocytes (PBL) were then collected from the interphase by aspiration 

40 with a Pasteur pipette. The cells were diluted with an equal volume of PBS and spun again at 1500 rpm for 
15 minutes. The supernatant was aspirated, the cell pellet was resuspended in 1 ml PBS and the cells were 
distributed Into two Eppendorf tubes. 

Method 2. 40 ml human blood from a patient with HIV in the pre-AIDS condition was layered on Ficoll to 
separate the white cells (see Method 1 above). The white cells were then incubated in tissue culture 

45 medium for 4-5 days. On day 3, they were infected with Epstein Barr virus. The cells were pelleted (approx 
• 2x10^ cells) and washed in PBS. 

The cells were pelleted again and lysed with 7 ml 5M guanidine isothiocyanate, 50 mM Tris, 10 mM 
EDTA, 0.1 mM dithiothreitol. The cells were vortexed vigorously and 7 volumes of 4M LiCI added. The 
mixture was incubated at 4'C for 15-20 hrs. The suspension was spun and the supernatant resuspended in 

50 3M UCl and centrifuged again. The pellet was dissolved in 2ml 0.1 % SDS, 10 mM Tris HCI and 1 mM 
EDTA. The suspension was frozen at -20 'C, and thawed by vortexing for 20 s every 10 min for 45 min. A 
large white pellet was left behind and the clear supernatant was extracted with phenol chloroform, then with 
chloroform. The RNA was precipitated by adding 1/10 volume 3M sodium acetate and 2 vol ethanol and 
leaving overnight at -20 *C. The pellet was suspended in 0.2 ml water and reprecipitated with ethanol. 

55 Aliquots for cDNA synthesis were taken from the ethanol precipitate which had been vortexed to create a 
fine suspension. 

100 ul of the suspension was precipitated and dissolved In 20 ul water for cDNA synthesis [30] using 
10 pmole of a HUFOR primer (see below) in final volume of 50 ul. A sample of 5 ul of the cDNA was 
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amplified as in Example 2 except using the primers for the human VH gene families (see below) using a 

cycle of 95 • C, 60 • C and 72 • C. 

The back primers for the amplification of human DNA were designed to match the available human 

heavy and light chain sequences, in which the different families have slightly different nucleotide sequences 
5 at the 5' end. Thus for the human VH genes, the primers Hu2VHIBACK, HuVHIIBACK, Hu2VHIIIBACK and 

HuVHIVBACK were designed as back primers, and HUJH1F0R. HUJH2F0R and HUJH4F0R as forward 

primers based entirely in the variable gene. Another set of forward primers HulVHFOR, Hu2VHF0R, 

HuGVHFOR, and Hu4VHF0R was also used, which were designed to match the human J-reglons and the 5' 

end of the constant regions of different human isotopes. 
10 Using sets of these primers It was possible to demonstrate a band of amplified ds cDNA by gel 

electrophoresis. 

One such experiment was analysed in detail to establish whether there was a diverse repertoire in a 
patient with HIV infection. It is known that during the course of AIDS, that T-cells and also antibodies are 
greatly diminished In the blood. Presumably the repertoire of lymphocytes Is also diminished. In this 

15 experiment, for the forward priming, an equimolar mixture of primers HulVHFOR, Hu2VHF0R, HuSVHFOR, 
and Hu4VHF0R (in PGR 25 pmole of primer 5' ends) was used. For the back priming, the primers 
Hu2VHIBACK, HuVHIIBACK, Hu2VHIIIBACK and HuVHIVBACK were used separately in four separate 
primings. The amplified DNA from the separate primings was then pooled, digested with restriction 
enzymes PstI and BstEII as above, and then cloned into the vector M13VHPCR1 for sequencing. The 

20 sequences reveal a diverse repertoire (Fig. 1 1) at this stage of the disease. 

For human Vx genes the primers HuJKIFOR. HUJK3F0R. HUJK4F0R and HUJK5F0R were used as 
fon/vard primers and VK1BACK as back primer. Using these primers It was possible to see a band of 
amplified ds cDNA of the correct size by gel electrophoresis. 

25 Example 4 

Cloning of unrearranged variable gene genomic DNA from human peripheral blood lymphocytes 

Human peripheral blood lymphocytes of a patient with non-Hodgklns lymphoma were prepared as In 
30 Example 3 (Method 1). The genomic DNA was prepared from the PBL using the technique described in 
Example 2 (Method 2). The VH region In the isolated genomic DNA was then amplified and cloned using 
the general protocol described in the first two paragraphs of the section headed "Amplification from 
RNA/DNA hybrid" in Example 1 above, except that during the annealing part of each cycle, the temperature 
was held at 55 • C and that 30 cycles were used. At the end of the 30 cycles, the reaction mixture was held 
35 at 60'C for five minutes to ensure that complete elongation and renaturation of the amplified fragments had 
taken place. 

The forward primer used was HuHeplFOR. which contains a Sad site. This primer is designed to 
anneal to the 3' end of the unrearranged human VH region gene, and in particular includes a sequence 
complementary to the last three codons in the VH region gene and nine nucleotides downstream of these 
40 three codons. 

As the back primer, an equimolar mixture of HuOctalBACK. HuOcta2BACK and HuOcta3BACK was 
used. These primers anneal to a sequence in the promoter region of the genomic DNA VH gene (see 
Figure 1). 5ul of the amplified DNA was checked on 2% agarose gels In TBE buffer and stained with 
ethidium bromide. A double band was seen of about 620 nucleotides which corresponds to the size 
45 expected for the unrearranged VH gene. The ds cDNA was digested with Sad and cloned into an M13 
' vector for sequencing. Although there are some sequences which are identical, a range of different 
unrearranged human VH genes were identified (Rgure 12). 

Example 5 

50 

Cloning Variable Domains with Binding Activities from a Hybridoma 

The heavy chain variable domain (VHLYS) of the D1.3 (anti-lysozyme) antibody was cloned into a 
vector similar to that described previously [42] but under the control of the lac z promoter, such that the 
55 VHLYS domain Is attached to a pelB leader sequence for export into the periplasm. The vector was 
constructed by synthesis of the pelB leader sequence [43], using overlapping oligonucleotides, and cloning 
into a pUC 19 vector [35]. The VHLYS domain of the D1.3 antibody was derived from a cDNA clone [44] 
and the construct (pSWI) sequenced (Rgure 13). 
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To express both heavy and light chain variable donnains together, the light chain variable region 
(VKLYS) of the D1.3 antibody was Introduced Into the pSW1 vector, with a pelB signal sequence to give the 
construct pSW2 (Figure 14). 

A strain of E. coli (BMH71-18) [45] was then transformed [46,47] with the plasmid pSWI or pSW2, and 
5 colonies resistant to ampicillin (100 ug/ml) were selected on a rich (2 x TY = per litre of water, 16g Bacto- 
tryptone, lOg yeast extract, 5g NaCI) plate which contained 1% glucose to repress the expression of 
variable domain(s) by catabolite repression. 

The colonies were inoculated into 50 nr^l 2 x TY (with 1% glucose and 100 ug/m! ampicillin) and grown 
in flasks at 37'C with shaking for 12-16 hr. The cells were centrifuged. the pellet washed twice with 50 mM 
10 sodium chloride, resuspended in 2 x TY medium containing 100 ixg/ml ampicillin and the inducer IPTG (1 
mM) and grown for a further 30 hrs at 37 ^C. The cells were centrifuged and the supernatant was. passed 
through a Nalgene filter (0.45 urn) and then down a 1 - 5 ml lysozyme-Sepharose affinity column. (The 
column was derived by coupling lysozyme at 10 mg/ml to CNBr activated Sepharose.) The column was first 
washed with phosphate buffered saline (PBS), then with 50 mM diethylamine to elute the VHLYS domain 
75 (from pSWI ) or VHLYS in association with VKLYS (from pSW2). 

The VHLYS and VKLYS domains were identified by SDS polyacrylamide electrophoresis as the correct 
size. In addition, N-terminal sequence determination of VHLYS and VKLYS isolated from a polyacrylamide 
gel showed that the signal peptide had been produced correctly. Thus both the Fv fragment and the VHLYS 
domains are able to bind to the lysozyme affinity column, suggesting that both retain at least some of the 
20 affinity of the original antibody. 

The size of the VHLYS domain was compared by FPLC with that of the Fv fragment on Superose 12. 
This indicates that the VHLYS domain is a monomer. The binding of the VHLYS and Fv fragment to 
lysozyme was checked by ELISA, and equilibrium and rapid reaction studies were carried out using 
fluorescence quench. 
25 The ELISA for lysozyme binding was undertaken as follows: 

(1) The plates (Dynatech Immulon) were coated with 200 \l\ per well of 300 ug/ml lysozyme in 50 mM 
NaHCOa. pH 9.6 overnight ar room temperature; 

(2) The wells were rinsed with three washes of PBS, and blocked with 300 a.1 per well of 1% Sainsbury*s 
instant dried skimmed milk powder In PBS for 2 hours at 37 •C: 

30 (3) The wells were rinsed with three washes of PBS and 200 ul of VHLYS or Fv fragment (VHLYS 
associated with VKLYS) were added and incubated for 2 hours at room temperature; 

(4) The wells were washed three times with 0.05% Tween 20 in PBS and then three times with PBS to 
remove detergent; 

(5) 200 ul of a suitable dilution (1:1000) of rabbit polyclonal antisera raised against the FV fragment in 
35 2% skimmed milk powder in PBS was added to each well and Incubated at room temperature for 2 

hours; 

(6) Washes were repeated as in (4); 

(7) 200 ul of a suitable dilution (1:1000) of goat anti-rabbit antibody (ICN Immunochemicals) coupled to 
horse radish peroxidase, in 2% skimmed milk powder in PBS, was added to each well and incubated at 

40 room temperature for 1 hour; 

(8) Washes were repeated as in (4); and 

(9) 200 ul 2,2'azino-bis(3-ethylbenzthiazolinesulphonic acid) [Sigma] (0.55 mg/ml, with 1 ul 20% 
hydrogen peroxide: water per 10 ml) was added to each well and the colour allowed to develop for up to 
10 minutes at room temperature. 

45 The reaction was stopped by adding 0.05% sodium azide in 50 mM citric acid pH 4.3. ELISA plates 
were read in a TItertek Multiscan plate reader. Supernatant from the Induced bacterial cultures of both 
pSWI (VHLYS domain) or pSW2 (Fv fragment) was found to bind to lysozyme in the ELISA. 

The purified VHLYS and Fv. fragments were titrated with lysozyme using fluorescence quench (Perkin 
Elmer LS5B Luminescence Spectrometer) to measure the stolchiometry of binding and the affinity constant 

50 for lysozyme [48,49]. The titration of the Fv fragment at a concentration of 30 nM indicates a dissociation 
constant of 2.8 nM using a Scatchard analysis. 

A similar analysis using fluorescence quench and a Scatchard plot was carried out for VHLYS, at a 
VHLYS concentration of 100 nM. The stolchiometry of antigen binding is about 1 mole of lysozyme per 
mole of VHLYS (calculated from plot). (The concentration of VH domains was calculated from optical 

55 density at 280 nM using the typical extinction coefficient for complete immunoglobulins.) Due to possible 
errors in measuring low optical densities and the assumption about the extinction coefficient, the 
stolchiometry was also measured more carefully. VHLYS was titrated with lysozyme as above using 
fluorescence quench. To determine the concentration of VHLYS a sample of the stock solution was 
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removed, a known amount of norleucine added, and the sample subjected to quantitative amino acid 
analysis. This showed a stolchiometry of 1.2 mole of lysozyme per mole of VHLYS domain. The 
dissociation constant was calculated at about 12 nM. 

The on-rates for VHLYS and Fv fragments with lysozyme were determined by stopped-flow analysis (HI 

5 Tech Stop Flow SHU machine) under pseudo-first order conditions with the fragment at a ten fold higher 
concentration than lysozyme [50]. The concentration of lysozyme binding sites was first measured by 
titration with lysozyme using fluorescence quench as above. The on rates were calculated per mole of 
binding site (rather than amount of VHLYS protein). The on-rate for the Fv fragment was found to be 2.2 x 
10^ M-^s"^ at 25 -C. The on-rate for the VHLYS fragment found to be 3.8 x 10^ M"^ s"^ and the off-rate 

10 0.075 s"i at 20*C. The calculated affinity constant is 19 nM. Thus the VHLYS binds to lysozyme with a 
dissociation constant of about 19 nM, compared with that of the Fv of 3 nM. 

Example 6 

75 Cloning complete variable domains with binding activities from mRNA or DNA of antibody-secreting cells 

A mouse was immunised with hen egg white lysozyme (100 ixg i.p. day 1 in complete Freunds 
adjuvant), after 14 days immunised i.p. again with 100 ug lysozyme with incomplete Freunds adjuvant, and 
on day 35 i.v. with 50 ug lysozyme in saline. On day 39, spleen was harvested. A second mouse was 
20 immunised with keyhole limpet haemocyanin (KLH) in a similar way. The DNA was prepared from the 
spleen according to Example 2 (Method 2). The VH genes were amplified according to the prefenred 
method in Example 2. 

Human peripheral blood lymphocytes from a patient infected with HIV were prepared as in Example 3 
(Method 2) and mRNA prepared. The VH genes were amplified according to the method described in 
25 Example 3, using primers designed for human VH gene families. 

After the PGR, the reaction mixture and oil were extracted twice with ether, once with phenol and once 
with phenol/CHCb. The double stranded DNA was then taken up in 50 ul of water and frozen. 5 ii\ was 
digested with PstI and BstEII (encoded within the amplification primers) and loaded on an agarose gel for 
electrophoresis. The band of amplified DNA at about 350 bp was extracted. 

30 

Expression of anti-lysozyme activities 

The repertoire of amplified heavy chain variable domains (from mouse Immunised with lysozyme and 
from human PBLs) was then cloned directly into the expression vector pSWIHPOLYMYC. This vector is 

35 derived from pSWI except that the VHLYS gene has been removed and replaced by a polylinker restriction 
site. A sequence encoding a peptide tag was inserted (Figure 15). Colonies were toothpicked into 1 ml 
cultures. After induction (see Example 5 for details), 10 ul of the supernatant from fourteen 1 ml cultures 
was loaded on SDS-PAGE gels and the proteins transfen-ed electrophoretically to nitrocellulose. The blot 
was probed with antibody 9E10 directed against the peptide tag. 

40 The probing was undertaken as follows. The nitrocellulose filter was incubated in 3% bovine serum 
albumin (BSA)/TBS buffer for 20 min (10 x TBS buffer is 100 mM Tris.HCI. pH 7.4, 9% w/v NaCI). The filter 
was incubated in a suitable dilution of antibody 9E10 (about 1/500) in 3% BSA/TBS for 1 - 4 hrs. After three 
washes in TBS (100 ml per wash, each wash for 10 min). the filter was incubated with 1:500 dilution of anti- 
mouse antibody (peroxidase conjugated anti-mouse Ig (Dakopats)) in 3% BSA/TBS for 1 - 2 hrs. After three 

45 washes in TBS and 0.1% Triton X-100 (about 100 ml per wash, each wash for 10 min). a solution containing 
10 ml chloronapthol in methanol (3 mg/ml), 40 ml TBS and 50 ul hydrogen peroxide solution was added 
over the blot and allowed to react for up to 10 min. The substrate was washed out with excess water. The 
blot revealed bands similar in mobility to VHLYSMYC on the Western blot, showing that other VH domains 
could be expressed. 

50 Colonies were then toothpicked individually into wells of an ELISA plate (200 ul) for growth and 
induction. They were assayed for lysozyme binding with the 9E10 antibody (as in Examples 5 and 7). Wells 
with lysozyme-binding activity were identified. Two positive wells (of 200) were identified from the amplified 
mouse spleen DNA and one well from the human cDNA, The heavy chain variable domains were purified on 
a column of lysozyme-Sepharose, The affinity for lysozyme of the clones was estimated by fluorescence 

55 quench titration as >50nM. The affinities of the two clones (VH3 and VH8) derived from the mouse genes 
were also estimated by stop flow analysis (ratio of kon/koff) as 12 nM and 27 nM respectively. Thus both 
these clones have a comparable affinity to the VHLYS domain. The encoded amino acid sequences of of 
VH3 and VH8 are given in Figure 16, and that of the human variable domain in Figure 17. 
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A library of VH domains made from the mouse immunised with lysozyme was screened for both 
lysozyme and keyhole limpet haemocyanin (KLH) binding activities. Two thousand colonies were toothpic- 
ked in groups of five into wells of ELISA plates, and the supernatants tested for binding to lysozyme coated 
plates and separately to KLH coated plates. Twenty one supernatants were shown to have lysozyme 
5 binding activities and two to have KLH binding activities. A second expression library, prepared from a 
mouse immunised with KLH was screened as above. Fourteen supernatants had KLH binding activities and 
a single supematant had lysozyme binding activity. 

This shows that antigen binding activities can be prepared from single VH domains, and that 
immunisation facilitates the isolation of these domains. 

Example 7 

Cloning variable domains with binding activities by mutagenesis. 

75 Taking a single rearranged VH gene, it may be possible to derive entirely new antigen binding activities 
by extensively mutating each of the CDRs. The mutagenesis might be entirely random, or be derived from 
pre-existing repertoires of CDRs. Thus a repertoire of CDR3s might be prepared as in the preceding 
examples by using "universal" primers based in the flanking sequences, and likewise repertoires of the 
other CDRs (singly or in combination). The CDR repertoires could be stitched into place in the flanking 

20 framework regions by a variety of recombinant DNA techniques. 

CDR3 appears to be the most promising region for mutagenesis as CDRS is more variable in size and 
sequence than CDRs 1 and 2. This region would be expected to make a major contribution to antigen 
binding. The heavy chain variable region (VRLYS) of tiie anti-lysozyme antibody D1.3 is known to make 
several important contacts in the CDR3 region. 

25 Multiple mutations were made in CDR3. The polymerase chain reaction (PCR) and a highly degenerate 
primer were used to make the mutations and by this means the original sequence of CDR3 was destroyed. 
(It would also have been possible to construct the mutations in CDR3 by cloning a mixed oligonucleotide 
duplex into restriction sites flanking the CDR or by other methods. of site-directed mutagenesis). Mutants 
expressing heavy chain variable domains with affinities for lysozyme were screened and those with 

30 improved affinities or new specificities were identified. 

The source of the heavy chain variable domain was an Ml 3 vector containing the VHLYS gene. The 
body of the sequence encoding the variable region was amplified using the polymerase chain reaction 
(PCR) with tiie mutagenic primer VHMUT1 based in CDR3 and the Ml 3 primer which is based in tiie M13 
vector backbone. The mutagenic primer hypermutates the central four residues of CDR3 (Arg-Asp-Tyr-Arg). 

35 The PCR was carried out for 25 cycles on a Techne PHC-1 programmable heat block using 100 ng single 
stranded M13mp19SW0 template, witii 25 pmol of VHMUT1 and the M13 primer. 0.5 mM each dNTP. 
67mM Tris.HCI, pH 8.8, 10 mM MgC12, 17 mM (NH4)2S04. 200 ug/ml gelatine and 2.5 units Taq 
polymerase in a final volume of 50 ul. The temperature regime was 95 'C for 1.5 min, 25 'C for 1.5 min and 
72 *C for 3 min (However a range of PCR conditions could be used). The reaction products were extracted 

40 with phenol/chloroform, precipitated with ethanol and resuspended in 10 mM Tris. HCI and 0.1 mM EDTA, 
pH 8.0. 

The products from the PCR were digested with Psti and BstEII and purified on a 1.5% LGT agarose gel 
in Tris acetate buffer using Geneclean (Bio 101, LaJolla). The gel purified band was ligated into 
pSW2HP0LY (Figure 19). (This vector is related to pSW2 except that the body of the VHLYS gene has 

45 been replaced by a polylinker.) The vector was first digested with BstEII and PstI and treated with calf- 
intestinal phosphatase. Aliquots of the reaction mix were used to transform E. coll BMH 71-18 to ampicillin 
resistance. Colonies were selected on ampicillin (100 ug/ml) rich plates containing glucose at 0.8% w/v. 

Colonies resulting from transfection were picked in pools of five into two 96 well Corning microtitre 
plates, containing 200 ul 2 x TY medium and 100 ul TY medium, 100 ug/ml ampicillin and 1% glucose. 

50 The colonies were grown for 24 hours at 37 * C and then cells were washed twice in 200 ul 50 mM NaCI, 
pelleting the cells in an lEC Centra-3 bench top centrifuge with microtitre plate head fitting. Plates were 
spun at 2,500 rpm for 10 min at room temperature. Cells were resuspended in 200 ul 2 x TY, 100 ug/ml 
ampicillin and 1 mM IPTG (Sigma) to induce expression, and grown for a further 24 hr. 

Cells were spun down and the supernatants used in ELISA with lysozyme coated plates and anti- 

55 idiotypic sera (raised in rabbits against the Fv fragment of the D1 .3 antibody). Bound anti-idiotypic serum 
was detected using horse radish peroxidase conjugated to anti-rabbit sera (ICN Immunochemicals). Seven 
of the wells gave a positive result in the ELISA. These pools were restreaked for single colonies which were 
picked, grown up. induced in microtitre plates and rescreened in the ELISA as above. Positive clones were 
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grown up at the 50 ml scale and expression was induced. Culture supernatants were purified as in Example 
5 on columns of lysozyme-Sepharose and eluates analysed on SDS-PAGE and staining with Page Blue 90 
(BDH). On elution of the column with diethylamine, bands corresponding to the VHLYS mutant domains 
were identified, but none to the VKLYS domains. This suggested that although the mutant domains could 

5. bind to lysozyme, they could no longer associate with the VKYLS domains. 

For seven clones giving a positive reaction In ELISA, plasmids were prepared and the VKLYS gene 
excised by cutting with EcoRI and religating. Thus the plasmids should only direct the expression of the 
VHLYS mutants. 1.5 ml cultures were grown and induced for expression as above. The cells were spun 
down and supernatant shown to bind lysozyme as above. (Alternatively the amplified mutant VKLYS genes 

10 could have been cloned directly into the pSW1 HPOLY vector for expression of the mutant activities in the 
absence of VKLYS.) 

An ELISA method was devised in which the activities of bacterial supernatants for binding of lysozyme 
(or KLH) were compared. Rrstly a vector was devised for tagging of the VH domains at its C-terminal region 
with a peptide from the c-myc protein which is recognised by a monoclonal antibody 9E10. The vector was 
75 derived from pSW1 by a BstEll and Smal double digest, and ligation of an oligonucleotide duplex made 
from 

5* GTC ACC GTC TCC TCA GAA CAA AAA CTC ATC TCA GAA GAG GAT 
CTG AAT TAA TAA 3 • and 

51 ipij.^ ijiip^ Qj^Q p^fj^Q ^j,Q rj^rj^Q rpQ^ Q^rp qj^q ipipp ^rjiQ fj^Q ijtQj^ 

GGA GAC G 3 '. 

25 The VHLYSMYC protein domain expressed after induction was shown to bind to lysozyme and to the 9E10 
antibody by ELISA as follows: 

(1) Falcon (3912) flat bottomed wells were coated with 180 ul lysozyme (3 mg/ml) or KLH (50 ug/ml) per 
well in 50 mM NaHC03, pH 9.6, and left to stand at room temperature overnight; 

(2) The wells were washed with PBS and blocked for 2 hrs at 37 • C with 200 ul 2% Sainsbury's instant 
30 dried skimmed milk powder in PBS per well; 

(3) The Blocking solution was discarded, and the walls washed out with PBS (3 washes) and 150 u I test 
solution (supernatant or purified tagged domain) pipetted into each well. The sample was incubated at 
37-Cfor2 hrs; 

(4) The test solution was discarded, and the wells washed out with PBS (3 washes). 100 al of 4 ag/ml 
35 purified 9E10 antibody in 2% Sainsbury's instant dried skimmed milk powder in PBS was added, and 

incubated at 37'C for 2 hrs: 

(5) The 9E10 antibody was discarded, the wells washed with PBS (3 washes). 100 ul of 1/500 dilution of 
anti-mouse antibody (peroxidase conjugated anti-mouse Ig (Dakopats)) was added and incubated at 
37 • C for 2 hrs; 

40 (6) The second antibody was discarded and wells washed three times with PBS; and 

(7) 100 ul 2,2'azino-bis(3-ethylbenzthiazolinesulphonic acid) [Sigma] (0.55 mg/ml. with 1 ul 20% 
hydrogen peroxide: water per 10 ml) was added to each well and the colour allowed to develop for up to 
10 minutes at room temperature. 

The reaction was stopped by adding 0.05% sodium azide in 50 mM citric acid. pH 4.3. ELISA plates 

45 were read in an Titertek Multiscan plate reader. 

The activities of the mutant supernatants were compared with VHLYS supernatant by competition with 
the VHLYSMYC domain for binding to lysozyme. The results show that supernatant from clone VHLYS- 
MUT59 is more effective than wild type VHLYS supernatant in competing for VHLYSMYC. Furthermore, 
Western blots of SDS-PAGE aliquots of supernatant from the VHLYS and VHLYSMUT59 domain (using anti- 

50 Fv antisera) indicated comparable amounts of the two samples. Thus assuming identical amounts of VHLYS 
and VHLYSMUT59, the affinity of the mutant appears to be greater than that of the VHLYS domain. 

To check the affinity of the VHLYSMUT59 domain directly, the clone was grown at the 11 scale and 
200-300 ug purified on lysozyme-Sepharose as in Example 5. By fluorescence quench titration of samples 
of VHLYS and VHLYSMUT59. the number of binding sites for lysozyme were determined. The samples of 

55 VHLYS and VHLYSMUT59 were then compared in the competition ELISA with VHLYSMYC over two orders 
of magnitude. In the competition assay each microtitre well contained a constant amount of VHLYSMYC 
(approximately 0,6 ug VHLYSMYC). Varying amounts of VHLYS or VHLYSMUT59 (3.8 uM in lysozyme 
binding sites) were added (0.166 - 25 ul). The final volume and buffer concentration in all wells was 
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constant. 9E10 (antl-myc) antibody was used to quantitate bound VHLYSMYC in each assay well. The % 

inhibition of VHLYSMYC binding was calculated for each addition of VHLYS or VHLYSMUT59, after 

subtraction of background binding. Assays were carried out in duplicate. The results indicate that VHLYS- 

MUT59 has a higher affinity for lysozyme than VHLYS. 
5 The VHLYSMUT59 gene was sequenced (after recloning into M13) and shown to be identical to the 

VHLYS gene except for the central residues of CDR3 (Arg-Asp-Tyr-Arg). These were replaced by Thr-Gln- 

Arg-Pro: (encoded by ACACAAAGGCCA). 

A library of 2000 mutant VH clones was screened for lysozyme and also for KLH binding (toothpicking 

5 colonies per well as described in Example 6). Nineteen supernatants were identified with lysozyme 
10 binding activities and four with KLH binding activities. This indicates that new specificites and improved 

affinities can be derived by making a random repertoire of CDR3. 

Example 8 

75 Construction and expression of double domain for lysozyme binding. 

The finding that single domains have excellent binding activities should allow the construction of strings 
of domains (concatamers). Thus, multiple speciticlties could be built into the same molecule, allowing 
binding to different epitopes spaced apart by the distance between domain heads. Flexible linker regions 
20 could be built to space out the domains. In principle such molecules could be devised to have exceptional 
specificity and affinity. 

Two copies of the cloned heavy chain variable gene of the D1.3 antibody were linked by a nucleotide 
sequence encoding a flexible linker 
Gly-Gly-Gly-Aia-Pro-Ala-Ala-Ala-Pro-Ala-Gly-Gly-Gly- 

25 (by several steps of cutting, pasting and site directed mutagenesis) to yield the plasmid pSW3 (Rgure 20). 
The expression was driven by a lacz promoter and the protein was secreted into the periplasm via a pelB 
leader sequence (as described in Example 5 for expression of pSW1 and PSW2). The protein could be 
purified to homogeneity on a lysozyme affinity column. On SDS polyacrylamide gels, it gave a band of the 
right size (molecular weight about 26,000). The protein also bound strongly to lysozyme as detected by 

30 ELISA (see Example 5) using anti-idiotypic antiserum directed against the Fv fragment of the D1.3 antibody 
to detect the protein. Thus, such constructs are readily made and secreted and at least one of the domains 
binds to lysozyme. 

■ Example 9 

35 

Introduction of cysteine residue at C-terminal end of VHLYS 

A cysteine residue was introduced at the C-terminus of the VHLYS domain in the vector pSW2. The 
cysteine was introduced by cleavage of the vector with the restriction enzymes BstI and Smal (which 
40 excises the C-terminal portion of the J segment) and ligation of a short oligonucleotide duplex 
5' GTC ACC GTC TCC TCA TGT TAA TAA 3' and 
5' TTA TTA ACA TGA GGA GAC G 3'. 

By purification on an affinity column of lysozyme Sepharose it was shown that the VHLYS-Cys domain was 
expressed in association with the VKLYS variable domain, but the overall yields were much lower than the 
45 wild type Fv fragment. Comparison of non-reducing and reducing SDS polyacrylamide gels of the purified 
Fv-Cys protein indicated that the two VH-Cys domains had become linked through the introduced cysteine 
residue. 

Example 10 

50 

Linking of VH domain with enzyme 

Linking of enzyme activities to VH domains should be possible by either cloning the enzyme on either 
the N-terminal or the C-terminal side of the VH domain. Since both partners must be active, it may be 
55 necessary to design a suitable linker (see Example 8) between the two domains. For secretion of the VH- 
enzyme fusion, it would be preferable to utilise an enzyme which is usually secreted. In Figure 21 , there is 
shown the sequence of a fusion of a VH domain with alkaline phosphatase. The alkaline phosphatase gene 
was cloned from a plasmid carrying the E. coli alkaline phosphatase gene in a plasmid pEK48 [51] using 
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the polymerase chain reaction. The gene was amplified with the primers 

5» CAC CAC GGT CAC CGT CTC CTC ACG GAC ACC AGA AAT GCC TGT 
5 TCT G 3' and 

5 » GCG AAA ATT CAC TCC CGG GCG CGG TTT TAT TTC 3 • • 

The gene was introducecl into the vector pSW1 by cutting at BstEII and Smal. The construction (Figure 21) 
10 was expressed in E. coli strain BMH71-18 as in Example 5 and screened for phosphatase activity using 1 
mg/ml p-nitrophenylphosphate as substrate in 10mM diethanolamine and 0.5 mM MgCI^, pH 9.5) and also 
on SDS polyacrylamide gels which had been Western blotted (detecting with antl-idiotypic antiserum). No 
evidence was found for the secretion of the linked VHLYS-alkaline phosphatase as detected by Western 
blots (see Example 5), or for secretion of phosphatase activity. 
75 However when the construct was transfected into a bacterial strain BL21DE3 [52] which is deficient in 
proteases, a band of the correct size (as well as degraded products) was detected on the Western blots. 
Furthermore phosphatase activity could now be detected in the bacterial supernatant. Such activity Is not 
present in supernatant from the strain which had not been transfected with the construct. 

A variety of linker sequences could then be introduced at the BstEII site to improve the spacing 
20 between the two domains. 

Example 11 

Coexpression of VH domains with Vk repertoire 

25 

A repertoire of Vx genes was derived by PGR using primers as described in Example 2 from DNA 
prepared from mouse spleen and also from mouse spleen mRNA using the primers VK3F0R and 
VK2BACK and a cycle of 94*C for 1 min. 60 'C for 1 min, 72 'C for 2 min. The PGR amplified DNA was 
fractionated on the agarose gel, the band excised and cloned into a vector which carries the VHLYS domain 
30 (from the D1 .3 antibody), and a cloning site (Sac! and Xhol) for cloning of the light chain variable domains 
with a myc tail (pSWIVHLYS-VKPOLYMYG, Figure 22). 

Glones were screened for lysozyme binding activities as described in Examples 5 and 7 via the myc 
tag on the light chain variable domain, as this should permit the following kinds of Vx domains to be 
identified: 

36 (1) those which bind to lysozyme in the absence of the VHLYS domain; 

(2) those which associate with the heavy chain and make no contribution to binding of lysozyme: and 

(3) those which associate with the heavy chain and also contribute to binding of lysozyme (either helping 
or hindering). 

This would not identify those Vx domains which associated with the VHLYS domain and completely 
40 abolished its binding to lysozyme. 

In a further experiment, the VHLYS domain was replaced by the heavy chain variable domain VH3 
which had been isolated from the repertoire (see Example 6). and then the Vx domains cloned into the 
vector. (Note that the VH3 domain has an internal Sad site and this was first removed to allow the cloning 
of the Va repertoire as Sacl-Xhol fragments.) 
45 By screening the supernatant using the ELISA described in Example 6, bacterial supernatants wilt be 
identified which bind lysozyme. 

Example 12 

50 High expression of VH domains. 

By screening several clones from a VH library derived from a mouse immunised with lysozyme via a 
Western blot, using the 9E10 antibody directed against the peptide tag, one clone was noted with very high 
levels of expression of the domain (estimated as 25 - 50 mg/l). The clone was sequenced to determine the 
55 nature of the sequence. The sequence proved to be closely related to that of the VHLYS domain, except 
with a few amino acid changes (Figure 23). The result was. unexpected, and shows that a limited number of 
amino acid changes, perhaps even a single amino acid substitution, can cause greatly elevated levels of 
expression. 


25 


EP 0 368 684 B1 


By making mutations of the high expressing domain at these residues, it was found that a single amino 
acid change in the VHLYS domain(Asn 35 to His) is sufficient to cause the domain to be expressed at high 
levels. 


5 CONCLUSION 

It can thus be seen that the present invention enables the cloning, amplification and expression of 
heavy and light chain variable domain encoding sequences in a much more simple manner than was 
. previously possible. It also shows that Isolated variable domains or such domains linked to effector 
70 molecules are unexpectedly useful. 

It will be appreciated that the present invention has been described above by way of example only and 
that variations and modifications may be made by the skilled person without departing from the scope of 
the invention. 
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Claims 

1. A method of cloning sequences (target sequences) each containing a sequence encoding at least part 
15 of an immunoglobulin variable domain, which method comprises providing a sample repertoire of 

nucleic acid containing target sequences, and using fonArard and back primers in the process of 
copying and cloning of the target sequences for expression of a repertoire of proteins each comprising 
at least part of an immunoglobulin variable domain, the forward primer being specific for a sequence at 
or adjacent the 3' end of the sense strand of each of the target sequences, the back primer being 
20 specific for a sequence at or adjacent the 3' end of the antisense strand of each of the target 
sequences. 

2. A method according to claim 1 which method comprises: 

(a) providing a sample repertoire of double-stranded nucleic acid containing target sequences; 
25 (b) causing the two strands of the doubled-stranded nucleic acid to be separated; 

(c) annealing to the sample a fon^rard and a back oligonucleotide primer, the forward primer being 
specific for a sequence at or adjacent the 3* end of the sense strand of each of the target 
sequences, the back primer being specific for a sequence at or adjacent the 3' end of the antisense 
strand of each of the target sequences, under conditions which allow the primers to hybridize 

30 specifically to the nucleic acid; 

(d) treating the annealed sample with a DNA polymerase enzyme in the presence of deox- 
ynucleoside triphosphates under conditions which cause primer extension to take place, thereby 
producing double-stranded nucleic acid; 

(e) repeating steps (b) to (d), thereby producing some double-stranded DNA (product DNA) 
35 containing only the target sequences; 

(f) cloning product DNA into expression vectors for expression of a repertoire of proteins each 
comprising at least part of an immunoglobulin variable domain. 

3. A method according to claim 2 wherein steps (b) to (d) are repeated a plurality of times before step (f). 

40 

4. A method according to claim 1 , which comprises: 

(a) providing a repertoire of mRNA; 

(b) annealing to the mRNA an oligonucleotide primer specific for a sequence at or adjacent the 3' 
end of each of the target sequences on the sense strands, under conditions which allow the primer 

45 to hybridize specifically to the nucleic acid; 

(c) treating the primer-annealed mRNA with a polymerase enzyme in the presence of deox- 
ynucleoside triphosphates under conditions which cause primer extension to take place, thereby 
producing antisense cDNA; 

(d) annealing to the cDNA an oligonucleotide primer specific for a sequence at or adjacent the 3' 
50 end of each of the target sequences on the antisense strands, under conditions which allow the 

primer to hybridize specifically to the nucleic acid; 

(e) treating the primer-annealed cDNA with a polymerase enzyme in the presence of deox- 
ynucleoside triphosphates under conditions which cause primer extension to take place, thereby 
producing double-stranded DNA (product DNA); 

55 (f) cloning product DNA into expression vectors for expression of a repertoire of proteins each 

comprising at least part of an immunoglobulin variable domain. 

5. A method according to claim 4 wherein, after step (e) the following steps are performed before step (f): 


27 


EP 0 368 684 B1 


(i) causing the two strands of the product DNA to be separated; 

(H) annealing to the separated strands a forward and a back oligonucleotide primer, the forward 
primer being specific for a sequence at or adjacent the 3* end of the sense strand of each of the 
target sequences, the back primer being specific for a sequence at or adjacent the 3* end of the 
antisense strand of each of the target sequences, under conditions which allow the primers to 
hybridize specifically to the nucleic acid; 

(ill) treating the annealed sample with a DNA polymerase enzyme in the presence of deox- 
ynucleoside triphosphates under conditions which cause primer extension to take place, thereby 
producing double-stranded nucleic acid. 

A method according to any one of claims 1 to 5 wherein the back primer is specific for a sequence at 
or adjacent the 3' end of the antisense strand of the sequences which are contained in the target 
sequences and which each encode at least part of an immunoglobulin variable domain. 

75 7. A method according to any one of claims 1 to 6 wherein the sample repertoire of double-stranded 
nucleic acid is derived from lymphocytes. 

8. A method according to claim 7 wherein the lymphocytes are derived from an animal or human 
mounting an immune response to an antigen. 

20 

9. A method according to claim 7 wherein the lymphocytes are derived from a patient with an auto- 
immune disease. 

10. A method according to claim 1 wherein the sample repertoire of nucleic acid is derived from rearranged 
25 immunoglobulin variable region genes. 

11. A method according to claim 1 wherein the sample repertoire of nucleic acid is genomic DNA. 

12. A method according to claim 1 wherein the sample repertoire of nucleic acid is derived from 
30 unrearranged immunoglobulin variable region genes. 

13. A method according to any one of claims 1 to 12 wherein the target sequence contains a sequence 
encoding a variable domain from an immunoglobulin heavy chain. 

35 14. A method according to claim 13 wherein the product DNA is inserted into an expression vector for 
expression of single domain ligands selectable by their binding affinity for antigen. 

15. A method according to any one of claims 1 to 13 wherein product DNA is inserted into an expression 
vector for expression of antibodies or antibody fragments selectable by their binding affinity for antigen. 

40 

16. A method according to any one of claims 1 to 13 wherein the product DNA is inserted Into an 
expression vector for expression alone. 

17. The method of any one of claims 1 to 13 wherein the product DNA is inserted into an expression vector 
45 for expression in combination with a complementary variable domain. 

18. A method according to any one of claims 1 to 13 wherein the product DNA is inserted into an 
expression vector already containing sequences encoding one or more constant domains for expres- 
sion. 

50 

19. A method according to any one of claims 1 to 13 wherein the product DNA is Inserted Into an 
expression vector for expression as fusion proteins. 

20. A method according to any one of claims 1 to 13 wherein the product DNA is inserted into an 
55 expression vector for expression with peptide tags. 

21. A method according to any one of claims 1 to 13 wherein product DNA containing sequences encoding 
at least immunoglobulin heavy chain variable domains and product DNA containing sequences 


10 


6. 
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encoding at least immunoglobulin light chain variable domains are inserted into expression vectors for 
expression of a combinatorial repertoire of complementary variable domains. 

22. A method according to claim 21 wherein the product DNA Is Inserted into an expression vector already 
5 containing sequences encoding one or more constant domains for expression. 

23. A nnethod according to claim 21 wherein product DNA is inserted into expression vectors for expression 
as fusion proteins. 

70 24. A method according to claim 21 wherein the product DNA is inserted into an expression vector for 
expression with peptide tags. 

25. A method according to any one of claims 1 to 24 wherein the forward and back primers are provided as 
single oligonucleotides. 

76 

26. A method according to any one of claims 1 to 24 wherein the forward primers are supplied as a mixture 
of oligonucleotides. 

27. A method according to any one of claims 1 to 24 wherein the back primers are supplied as a mixture of 
20 oligonucleotides. 

28. A method according to any one of claims 1 to 27 wherein each primer Includes a sequence encoding a 
restriction enzyme recognition site. 

25 29. A method according to claim 28 wherein the restriction enzyme recognition site is located in the 
sequence which is annealed to the nucleic acid. 

30. A method according to claim 1 wherein the back primer is a general primer useful for cloning a desired 
antibody specificity from a specific species. 

30 

31. A method according to claim 1 wherein the back primer is a mixture of primers having a variety of 
sequences designed to be complementary to the various families of VH, Vk or V sequences. 

32. An expression library comprising a repertoire of nucleic acid sequences each encoding at least part of 
35 an immunoglobulin variable domain. 

Patentanspriiche 

1. Verfahren zum Klonieren von Sequenzen (Zielsequenzen). die jeweils eine Sequenz enthalten, die fur 
40 zumindest einen Tell einer variablen Immunoglobulindomane kodieren, welches Verfahren die Schaf- 

fung einer Probensammlung von Zielsequenzen enthaltenden Nuklelnsauren und die Verwendung von 
Vorwarts- und Ruckwarts-Primern beim Verfahren zum Kopieren und Klonieren der Zielsequenzen zur 
Expression einer Sammlung von Proteinen umfaBt, von denen jedes zumindest einen Tell einer 
variablen Immunoglobulindomane aufweist, wobei der Vonrtrartsprimer fCir eine Sequenz am oder im 
45 Bereich des 3*-Endes des "sense "-Strangs einer jeden der Zielsequenzen spezifisch ist und der 
RGckwartigsprlmer fGr eine Sequenz am oder im Bereich des 3'-Endes des "antlsense"-Strangs einer 
jeden der Zielsequenzen spezifisch ist. 

2. Verfahren nach Anspruch 1 , welches Verfahren umfaCt: 

50 (a) das Schaffen einer Probensammlung von Zielsequenzen enthaltenden doppelstrangigen Nuklein* 

sSuren; 

(b) das Bewirken der Trennung der beiden Strange der doppelstranglngen Nukleinsaure; 

(c) das Aniagern eines Vorwarts- und eines Ruckwartsoligonukleotidprimers an die Probe, wobei der 
Vorwartsprimer fOr eine Sequenz am oder im Bereich des 3*-Endes des "sense "-Stranges einer 

55 jeden der Zielsequenzen spezifisch ist und der Ruckwartsprimer fOr eine Sequenz am oder im 

Bereich des 3*-Endes des "antisense'-Stranges einer jeden der Zielsequenzen spezifisch ist, unter 
Bedingungen, die das spezifische Hybridisieren der Primer an die Nukleinsaure ermoglichen; 
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(d) das Behandein der angelagerten Probe mit einem DNA-Polymeraseenzym In Gegenwart eines 
Desoxynukieosidtriphosphats unter Bedingungen, die bewirken, daB Primerextension stattfindet, 
wodurch doppelstrangige Nukleinsaure erzeugt wird; 

(e) das Wiederholen der Schritte (b) bis (d), wodurch etwas doppelstrangige DNA (Produkt-DNA) 
5 erzeugt wird, die nur die Zielsequenzen enthalt; 

(f) das Klonieren von Produkt-DNA in Expressionsvektoren zur Expression einer Sammlung von 
Proteinen, diejweils zumindest einen Tell einer variablen Immunoglobullndomgine umfassen. 

3. Verfahren nach Anspruch 2, worin die Schritte (b) bis (d) vor Schritt (f) mehrere Male wiederholt 
70 werden. 

4. Verfahren nach Anspruch 1 , welches umfafit: 

(a) das Schaffen einer nnRNA-Sammlung; 

(b) das Anlagern eines Oligonukleotidprimers an die mRNA, der fur eine Sequenz am Oder im 
75 Bereich des 3'-Endes einer jeden der Zielsequenzen an den "sense"-Str3ngen spezifisch 1st, unter 

Bedingungen, die die spezifische Hybridislerung des Prinners an die Nukleinsaure ermoglichen; 

(c) das Behandlung der primerangelagerten mRNA mit einem Polymeraseenzym in Gegenwart von 
Desoxynukleosldtriphospaten unter Bedingungen, die bewirken, daB Primerextension stattfindet. 
wodurch "antisense"-cDNA erzeugt wird: 

20 (d) das Anlagern eines Oligonukleotidprrmers an die cDNA, der fur eine Sequenz am oder 

angrenzend am 3'-Ende einer jeden der Zielsequenzen an den "antisense"-Strangen spezifisch ist, 
unter Bedingungen, die die spezifische Hybridisierung des Primers an die Nukleinsaure ermogli- 
chen; 

(e) das Behandein der primerangelagerten cDNA mit einem Polymeraseenzym in Gegenwart von 
25 Desoxynukleosldtriphosphaten unter Bedingungen. die bewirken, dafl Primerextension stattfindet. 

wodurch doppelstrangige DNA (Produkt-DNA) erzeugt wird; 

(f) das Klonieren von Produkt-DNA in Expressionsvektoren zur Expression einer Sammlungen von 
Proteinen, die jeweils zumindest einen Teil einer variablen Immunoglobulindomane umfassen. 

30 5. Verfahren nach Anspruch 4, worin nach Schritt (e) die folgenden Schritte vor Schritt (f) durchgefuhrt 
werden: 

(i) das Bewirken der Trennung der beiden Strange der Produkt-DNA; 

(ii) das Aniagem eines Von^Srts- und eines RUckwSrtsoligonukleotidprimers an die getrennten 
Strange, wobei der Vorwarlsprimer fur eine Sequenz am oder im Bereich des 3*-Endes des 

35 "sense "-Stranges einer jeden der Zielsequenzen spezifisch ist und der Ruckwartsprimer fur eine 

Sequenz am oder Im Bereich des 3'-Endes des "antisense "-Stranges einer jeden der Zielsequenzen 
spezifisch ist, unter Bedingungen. die das spezifische Hybridisieren der Primer an die Nukleinsaure 
ermoglichen; 

(ill) das Behandein der angelagerten Probe mit einem DNA-Polymeraseenzym in Gegenwart von 
40 Desoxynukleosldtriphosphaten unter Bedingungen, die bewirken, daB Primerextension stattfindet. 

wodurch doppelstrangige Nukleinsaure erzeugt wird. 

6. Verfahren nach einem der AnsprQche 1 bis 5, worin der RQckwMrtsprimer fUr eine Sequenz am oder im 
Bereich des 3'-Endes des "antisense"-Stranges der Sequenzen spezifisch Ist. die in den Zielsequenzen 

45 enthalten sind und von denen jede fur zumindest einen Teil einer variablen Immunoglobulindomane 
kodiert. 

7. Verfahren nach einem der AnsprGche 1 bis 6, worin die Probensammlung von doppelstrangigen 
Nuklelnsauren von Lymphozyten abgeleitet ist. 

50 

a Verfahren nach Anspruch 7, worin die Lymphozyten von einem Tier oder Menschen stammen, der eine 
Immunreaktion auf ein Antigen zeigt. 

9. Verfahren nach Anspruch 7, worin die Lymphozyten von einem Patienten mit einer Autoimmunerkran- 
55 kung stammen. 

10. Verfahren nach Anspruch 1. worin die Nukleinsaurenprobensammlung von umorientierten variablen 
Immunoglobulindomanengenen abgeleitet ist. 


30 


EP 0 368 684 B1 


11. Verfahren nach Anspruch 1, worin die Nuklelnsaureprobensammlung genomlsche DNA ist. 

12. Verfahren nach Anspruch 1, worin die Nukleinsaureprobensammlung von nicht umorientierten variablen 
Imnnunoglobulindomanengenen abgeleitet ist. 

5 

13. Verfahren nach einem der Anspruche 1 bis 12. worin die Zielsequenz eine Sequenz enthalt, die fur 
eine variable DonnSne der schweren Kette eines Immunoglobulins kodlert. 

14. Verfahren nach Anspruch 13, worin die Produkt-DNA in einen Expressionsvektor zur Expression von 
10 Einzeldomanenliganden eingesetzt wird, die nach ihrer Bindungsaffinitgit fur Antigen auswahlbar sind. 

15. Verfahren nach einem der AnsprUche 1 bis 13, worin Produkt-DNA In einen Expressionsvektor zur 
Expression von Antikorpern oder AntikSrperfragmenten eingesetzt wIrd, die nach ihrer Bindungsaffinltat 
fur Antigen auswahlbar sind. 

15 

16. Verfahren nach einem der Anspruche 1 bis 13, worin die Produkt-DNA In einen Expressionvektor nur 

zur Expression eingesetzt wird. 

17. Verfahren nach einem der Anspruche 1 bis 13. worin die Produkt-DNA In einen Expressionvektor zur 
20 Expression in Komblnation mit einer komplement^ren variablen DomSne eingesetzt wird. 

18. Verfahren nach einem der Anspruche 1 bis 13, worin die Produkt-DNA in einen Expressionvektor 
eingesetzt wird, der bereits Sequenzen enthalt, die fUr eine oder mehrere konstante DomSnen zur 
Expression kodieren. 

25 

19. Verfahren nach einem der Anspruche 1 bis 13, worin die Produkt-DNA In einen Expressionvektor zur 
Expression als Fusionsproteine eingesetzt wird. 

20. Verfahren nach einem der Anspruche 1 bis 13 worin die Produkt-DNA in einen Expressionsvektor zur 
30 Expression mit Peptidanhangsein eingesetzt wird. 

21. Verfahren nach einem der Anspruche 1 bis 13, worin Produkt-DNA, die Sequenzen enthalt, die 
zumindest fOr variable Domanen derschweren Kette von Immunoglobulinen kodieren, und Produkt-DNA, 
die Sequenzen enthalt, die zumindest fur variable Domanen der leichten Kette von Immunoglobulinen 

35 kodieren, zur Expression einer Kombinationssammlung von komplementaren variablen Domanen in 
Expresslonvektoren eingesetzt wird. 

22. Verfahren nach Anspruch 21, worin die Produkt-DNA zur Expression in einen Expressionvektor 
eingesetzt wird, der bereits Sequenzen enthSlt. die fUr eine oder mehrere konstante DomSnen kodieren. 

40 

23. Verfahren nach Anspruch 21 , worin Produkt-DNA in Expresslonsvektoren zur Expression als Fusions- 
proteine eingesetzt wird. 

24. Verfahren nach Anspruch 21, worin Produkt-DNA in einen Expressionsvektor zur Expression mit 
45 Peptidanhangsein eingesetzt wird. 

25. Verfahren nach einem der Anspruche 1 bis 24, worin die VorwSrts- und RQckwartsprlmer als einzelne 
, Oligonukleotide vorhanden sind. 

50 26. Verfahren nach einem der Anspruche 1 bis 24. worin die Vorwartsprimer als eine Mischung aus 
Oligonukleotiden zugefUhrt werden. 

27. Verfahren nach einem der Anspruche 1 bis 24, worin die RQckwartsprlmer als eine Mischung aus 
Oligonukleotiden zugefQhrt werden. 

55 

28. Verfahren nach einem der AnsprOche 1 bis 27, worin jeder Primer eine Sequenz umfaflt. die fOr eine 
Restriktlonsenzymerkennungsstelle kodlert. 
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29. Vertahren nach Anspruch 28, worin die Restriktionsenzymerkennungsstelle sich in der Sequenz 
befindet, die an die Nukieinsaure angelagert 1st. 

30. Verfahren nach Anspruch 1, worin der Ruckwartsprimer ein allgemeiner Primer 1st, der zum Klonieren 
5 einer gewUnschten AntikorperspezifitMt von einer spozifischen Spezies nUtzlich ist. 

31. Verfahren nach Anspruch 1, worin der RUckwartsprlmer eine l\1lschung aus Primern mit einer Vielzahl 
von Sequenzen ist, die so konstruiert sind, da3 sie komptementSr zu den verschiedenen Fannillen von 
VH-, Vk- Oder V-Sequenzen sind. 

10 

32. Expressionskollektion, die eine Sammlung von Nukleinsauresequenzen umfaBt. von denen jede fur 
zumindest einen Toil einer variablen ImmunoglobulindomSne kodiert. 

Revendications 

75 

1. Methode de clonage de sequences (sequences cibles), chacune contenant une sequence codant au 
moins une partie d'un domaine variable d'immunoglobuiine , laqueile methode consiste k produire un 
repertoire d'echantillons de sequences cibles contenant de I'acide nucleique et a utiliser des amorces 
vers i'avant et vers I'arriere dans le processus de copiage et de clonage des sequences cibles pour 
20 I'expression d'un repertoire des proternes, chacune comprenant au moins une partie d'un domaine 
variable d'immunoglobuline , Tamorce vers Tavant etant specifique d'une sequence a ou pres de 
Textremite 3' du brin de sens de chacune des sequences cibles. Tamorce vers I'arriere §tant specifique 
d'une sequence h ou adjacente k rextr§mit^ 3' du brin anti-sens de chacune des sequences cibles. 

25 2. Methode selon la revendicatton 1 . laqueile consiste k: 

(a) pr^voir un repertoire d'echantillons de sequences cibles contenant de I'acide nucleique a deux 
brins ; 

(b) forcer les deux brins de I'acide nucleique k deux brins k se s^parer ; 

(c) recuire, sur f'echantillon ,une amorce d'ollgonucliotides vers I'avant et vers Tarri^re. I'amorce 
30 vers I'avant etant specifique d'une sequence a ou a proximite de I'extremite 3' du brin de sens de 

chacune des sequences cibles, ramorce vers I'arrilre etant specifique d'une sequence k ou 
adjacente a I'extremite 3' du brin de I'anti-sens de chacune des sequences cibles, dans des 
conditions qui permettent aux amorces de d'hybrider specifiquement k I'acide nucleique ; 

(d) traiter Techantillon recuit avec une enzyme d'ADN polymerase en presence de desoxynucleoside 
35 triphosphates dans des conditions qui provoquent Textension de I'amorce, pour ainsi produire I'acide 

nucleique a deux brins ; 

(e) repeter les etapes (b) a (d) pour ainsi produire de I'ADN a deux brins (ADN produit) ne contenant 
' que les sequences cibles ; 

(f) doner I'ADN produit dans des vecteurs d'expression pour I'expression d'un repertoire de 
40 prot^ines dont chacune comprend au moins une partie d'un domaine variable d'immunoglobuline . 

3. Methode selon la revendlcation 2, ou les etapes (b) a (d) sont repetees un certain nombre de fois avant 
I'etape (f). 


45 4. Methode selon la revendlcation 1 , qui consiste k : 

(a) produire un repertoire d'ARNm ; 

(b) recuire, sur I'ARNm, une amorce d'oligonucleotides specifique d'une sequence a ou adjacente a 
rextr^mite 3* de chacune des sequences cibles sur les brins de sens, dans des conditions qui 
permettent k I'amorce de s'hybrider specifiquement a I'acide nucleique ; 

50 (c) traiter I'ARNm recuit k I'amorce par une enzyme polymerase en presence de desoxynucleoside 

triphosphates dans des conditions qui provoquent I'extension de I'amorce pour ainsi produire I'ADNc 
anti-sens ; 

(d) recuire, k i'ADNc, une amorce d'oligonucleotides specifique d'une sequence k ou adjacente k 
I'extremite 3' de chacune des sequences cibles sur les brins anti-sens, dans des conditions qui 

55 permettent k I amorce de s'hybrider specifiquement k I'acide nucleique ; 

(e) traiter I'ADNc recuit k I'amorce par une enzyme polymerase en presence de desoxynucleoside 
triphosphates dans des conditions qui provoquent I'extension de I'amorce, pour ainsi produire de 
I'ADN k deux brins (ADN produit) ; 
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(f) doner I'ADN produit dans des vecteurs d'expression pour I'expression d*un repertoire de 
prot^ines. chacune comprenant au moins une partie d'un domaine variable d'immunoglobuline . 

5. Methode selon la revendication 4 ou, apres I'etape (e), on accomplit, avant I'etape (f), les etapes 
5 suivantes : 

(i) provoquer la separation des deux brins de I'ADN produit; 

(li) reculre, aux brins s^par§s, une amorce d'oligonucl^otides vers Tavant et vers I'arriere, Tamorce 
vers I'avant ^tant sp^cifique d'une sequence h ou adjacente k Textr^mlt^ 3' du brin de sens de 
chacune des sequences cibles, I'amorce vers I'arriere etant specifique d'une sequence ou 
10 adjacente h I'extr^mit6 3' du brin d'anti-sens de chacune des sequences cibles, dans des conditions 

qui permettent aux amorces de s'hybrlder sp^cifiquement h Tacide nucl^ique ; 
(Hi) traiter Techantillon recuit avec une enzyme d'ADN polymerase en presence de d^spxynucleosi- 
de triphosphates dans des conditions qui provoquent une extension de i'amorce pour ainsi produire 
f'acide nucl^ique a deux brins. 

15 

6. Methode selon I'une quelconque des revendications 1 a 5, ou I'amorce vers I'arriere est specifique 
d'une sequence h ou adjacente a I'extremlte 3' du brin d'antl-sens des sequences qui sont contenues 
dans les sequences cibles et dont chacune code au molns une partie d'un domaine variable 
d'immunoglobuline . 

20 

7. Methode selon I'une quelconque des revendications 1 a 6, oCi le repertoire d'echantillons de I'acide 
nucleique a deux brins est derive de lymphocytes. 

8. Methode selon la revendication 7, ou les lymphocytes sont derives d'un animal ou humain montrant 
25 une r^ponse Immune h un antigene. 

9. Methode selon la revendication 7, ou les lymphocytes sont derives d'un patient presentant une maladie 
auto-immune. 

30 10. Methode selon la revendication 1, ou le repertoire d'echantillons de I'acide nucleique est derive de 
g^nes de regions variables d'immunoglobuline qui sont r^arrang^es. 

11. Methode selon la revendication 1, oCi le repertoire d'echantillons de I'acide nucieique est I'ADN 
gSnomique. 

35 

12. Methode selon la revendication 1. ou le repertoire d'echantillons de I'acide nucleique est derive de 
genes de regions variables d'immunoglobuline qui ne sont pas rearrangees. 

13. Methode selon Tune quelconque des revendications 1 a 12. ou la sequence clble contient une 
40 sequence codant un domaine variable d'une chatne lourde d'immunoglobuline. 

14. Methode selon la revendication 13. ou I'ADN produit est Insere dans un vecteur d'expresslon pour 
I'expression de ligands de domaine simple pouvant etre seiectionnes par leur affinite de liaison pour 
I'antig&ne. 

45 

15. Methode selon I'une quelconque des revendications 1 ^ 13, oO I'ADN produit est insere dans un 
vecteur d'expression pour I'expression des anticorps ou des fragments d'antrcorps pouvant Stre 
seiectionnes par leur affinite de liaison pour rantigfene. 

50 16. Methode selon Tune quelconque des revendications 1 h 13, ou I'ADN produit est insere dans un 
vecteur d'expresslon pour I'expression seule. 

17. Methode selon I'une quelconque des revendications 1 ^ 13. ou I'ADN produit est insere dans un 
vecteur d'expression pour I'expression en combinaison avec un domaine variable compiementaire. 

55 

18. Methode selon I'une quelconque des revendications 1 Si 13. ou I'ADN produit est insere dans un 
vecteur d'expression contenant dejk des sequences codant un ou plusieurs domalnes constants pour 
I'expression. 
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19. Methode selon Tune quelconque des revendications 1 a 13, ou TADN produit est insere dans un 
vecteur d'expression pour Texpression en tant que proteines de fusion. - 

20. Methode selon Tune quelconque des revendications 1 a 13, ou I'ADN produit est insere dans un 
5 vecteur d'expression pour Texpression avec des marqueurs de peptides. 

21. M^tliode selon I'une quelconque des revendications 1 h 13, ou TADN produit contenant des sequences 
codant au moins des domaines variables de chaTne lourde d'immunoglobuline et TADN produit 
contenant des sequences codant au moins des domaines variables de chaTne legere d'lmmunoglobuli- 

10 ne sont inseres dans des vecteurs d'expression pour Texpression d'un repertoire en combinaison de 
domaines variables compl§mentalres. 

22. Methode selon la revendicatlon 21 , ou TADN produit est Insure dans un vecteur d'expression contenant 
deja des sequences codant un ou plusieurs domaines constants pour I'expression. 

75 

23. Methode selon ia revendication 21 . ou I'ADN produit est Ins^r^ dans des vecteurs d'expression pour 
I'expression en tant que proteines de fusion. 

24. methode selon la revendication 21. ou I'ADN produit est insere dans un vecteur d'expression pour 
20 {'expression avec des marqueurs de peptides. 

25. Methode selon I'une quelconque des revendications 1 a 24, ou les amorces vers Tavant et vers I'arriere 
sont produites sous la forme d'oligonucleotides simples. 

25 28. m§thode selon Tune quelconque des revendications 1 h 24, oCi les amorces vers I'avant sont fournies 
sous ia forme d'un melange d'oligonucleotides. 

27. Methode selon I'une quelconque des revendications 1 k 24, oD les amorces vers I'arriere sont fournies 
sous la forme d'un melange d'oligonucleotides. 

30 

28. Methode selon Tune quelconque des revendications 1 k 27, ou chaque amorce contient une sequence 
codant un site de reconnaissance d'une enzyme de restriction. 

. 29. Methode selon la revendication 28, ou le site de reconnaissance d'une enzyme de restriction est placS 
35 dans la sequence qui est recuite a I'acide nucleique. 

30. Methode selon la revendication 1. ou I'amorce vers I'arriere est une amorce generale utile pour doner 
une specificite d'ariticorps souhaite d'une espece sp^cifique. 

40 31. Methode selon ia revendication 1, oO I'amorce vers I'arridre est un melange d'amorces ayant une 
variete de sequences designees pour etre complementaires aux diverses families des sequences VH. 
Vk ou V. 

32. Librairie d'expression comprenant un repertoire de sequences d'acides nucl6iques dont chacune code 
45 au moins une partie d'un domaine variable d'immunoglobuline. 


50 
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M13 VBPCaML. 

HixiD 

M!XIIATGAATATGCAAATCCTCTG?ATCrACATTOTAAATATA^ 

10 20 30 40 50 60 

CAAACAGAAAAAOVTGAGATC^CAGTTCTCrcTACAGTTACTGAGCACAC^^ 

70 80 90 100 110 120 


MGWSCIILFLVATAT 
CATGGGATGGAGCTGTATCATCCTClTCirrGGTAGCAACAGCTAC^^ 

130 140 150 160 170 180 


AGTAGCAGGCTTGAGGTCTGGACATATATATX^GGTGACAATGACATCC^CTTrG^ 

190 200 210 220 230 240 

PstI 

1 51 10 

GVHSQVQLQESGPG L V R P 
TCTCCACAGGTGTCCACTCCXyiGGTCCAAClGCASGAGAGOGGTC 

250 260 270 280 290 . 300 

CDRl 

15 20 25 30 

SQTLSLTCTVSGSTFSSrWM 
CTAGCXAGACCCTGAGCCTGACCTGO^CCGTGrcTGGCAGCACCTTCAGCAGCT^^ 

310 320 330 340 350 360 

CDR2 

35 40 45 50 

HWVRQPPGR GLEW IGRXDP N 
TGCACTGGGTGAGACAGCCACCTGGACGAGGTCTOX^AGTGGATTGG^ 

370 380 390 400 410 420 

55 60 65 70 

SGGTKYNEKFKSKVTMlaVDT 
ATAGTGGTGGTACTAAGTACAATGAGAAGTTCAAGAGCAGAGTGACAATGCTGGTA^ 

430 440 450 460 470 480 

75 80 85 90 

SKNQFSLRLSSVTAADTAVY 
CCAGCAAGAACCAGTTCAGCCTGAGACTCAGCAGCGTGACAGCCGCCGACACCGCGGTCT 
490 500 510 520 530 540 

CDR3 - 

95 100 105 110 

YCARYD YYGSSYFDYyiGQGr 
ATTATTGTGCAAGATACGATTACTACGGTAGTAGCTACTTTGACTACTGGGGCCAAG^ 
550 560 570 580 590 600 


BstEII 
115 I 120 
T V T V S S 
CX^ACiSGICaCCGTCTCCTCAGGTGAGrcCTTAa^ 

610 620 630 640 650 660 

AGATTTTACTGCATTTGTTGGGGGGGAAATGTGTGTATCTGAATTT^ 

670 680 690 700 710 720 

CTAGGGACACCTTGOyVGTCAGAAAGGGTCATTGGGAGCCCXS^ 

730 740 750 760 770 780 

BamHI 
I 

TCCrCAQCTCCrAGACTTCATGGCCAGAGATTTATAS Crm O 

790 800 810 r I O. J 
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M13 VKPCRl 

HinD III 

MSOIATGAATATGCAAATCCTCTGAATCTACATGGK^^ 

38 48 58 68 78 88 


CAAACAGAW^CATGAGATCACAGTTCTCTCTACAGTTACTGAGCA^ 

98 108 118 128 138 148 


MGWSCIILFLVATAT 
CATGGGATGGAGCTGTATCATCCTCTTCTTGGTAGCAACAGCTACAGGTAAG^ 

158 168 1*78 188 198 208 


AGTAGCAGGCTTGAQGTCTGGACATATATATGCKTrGACAATGACATC^^ 

218 228 238 248 258 268 


Pvu II 

I 

1 5 10 

GVHSDIQLTQSPSSLSAS 

TCTCCACAGGTGTCCACTCCGACATCCafilSCa^ 

278 288 298 308 318 328 


CDRl 

15 20 25 30 

VGDRVTITC R' A SGNIHNYLA 
GOslGQGTGACAGAGTGACCATCACCTGTAGAGCCAGCGGTAACATCCAC^ 

338 348 358 368 378 388 

CDR2 

35 40 45 50 

WYQQKPGKAPKLLiyyrrrL 

CTTGGTACCAGCAGAAGCO^TAAGGCTCCAAAGCTQCTGATC^ 

398 408 418 428 438 448 


55 60 65 70 

A DGVPSRFSGSGSGTDFTFT 
TGGCTGACGGTGTGCCAAGCAGATTCAGOSGTAGCGGTAGCGGTACCGACTTCAC^ 

458 468 478 488 498 508 

CDR3 

75 80 85 90 

ISSLQPEDIATYYC<?i?rW5r 
CCATO^GCAGCCTCCAGCCa^GAGGACATOGCCACCTACia^ 

518 528 538 548 558 568 


Bel I (requires dam" host) 
I 

95 100 105 108 

PRTFGQGTKVVIKR 
CCCCAAGGACGTTCGGCCAAGGGACCAAGGTGGTGAlCAAAanxy^G^ 

578 588 598 608 618 628 


BamHI 
I 

TTGCTTCcTCAcrissAic^ crm c 

638 648 rlO. D 
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Sequence of MBrl VH 

Splice -1 
1 G V H S 
AGGTGTCCACTCC 

1 PstI 10 20 

QVQLQESGTELASPGASVTL 

CAGGTCCfiACTfy^AGGAGTCAGG AAC^ 
VHIBACK SITE 

30 CDRl 40 

S C K A S' G Y T F T I D H I InI W V K K R 
TCCTGOUVGGCTTCTGGCTACAaVTTTACrGACCATATTATAAATTGGGTA^ 

^ fi2a 53 CDR2 

PGQGLEWIG Ifi ;^ y ? V ? g V 
CCTGGACAGGGCCTTGAGTGGATTGGAAGGATTTATCCAGTAAGTGGTGTAACTAACTAC 

60 CXIR2 65 70 

i N O K F M Gl KATFSVDRSSNTVY 
AATCAAAAATTCATGGGCAAGGCCACATTCTCT^^ 

80 82A B C 83 90 r^Si. 

MVLNSLTSEDPAVYYCGR |g ^1 
ATGGTGTTGAACAGTCTGACATCTGAGGACCCTGCTGTCTATTACTGTGGAAGGGGCTTT 

CDR3 1 03 BstEII Splice 

Id f d TI w g q g t t v t v s s i 

flATTTTGACTAC TGGGGCCAAGGGACCACGGTCACCGTCTCCTCAG GT 

VHIFOR SITE 


Sequence of MBrl VK 

Splice -1 
4 G V H S 
AGGTGTCCACTCC 

1 PvuII 10 20 

D I Q L TQSPPSLTVSVGERVT 
GACATTCAHFlX^nrrj^CTCCA CCATCCCTG^^ 
VKIBACK SITE 

, 27A B q D E F ^Rl 

ISC IKSNONLLWSG NRR Y C L GI 
ATCAGTTGCAAATCCAATCAGAATCTTTTATGGMSTGGAAACCGAAGGTACTGTTTGGGC 
35 40 50 CDR2 

, WHQWKPGQTPTP LIT IW T S D Rl 
TGGCACCAGTGGAAACCAGGGCAAACTCCTACACCGTTGATCACCTGGACATCTGATAGG 

60 70 

I F Si GVPDRFIGSGSVTDFTLT 
TTCTCTGGAGTCCCTGATCGTTTCATAGGCAGTGGATCTGTGACAGATTTCACTCTGACC 
80 90 CDR3 

ISSVQAEDVAVYFCQ I Q H L D H 
ATCAGCAGTGTGCAGGCTGAAGATGTGGCAGTTTATTTCTGTCAGCAACATTTGGACCTT 

95 100 Bgill/Bcll Splice 

I P Y TI F G G G T K L E I K | 
CCGTACACGTTCGGAGG GGGGACCAAGCTGGRGATCAAACG TGAG 
VKIFOR SITE 


FIG. 6 
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a-Lys 30 



BaiiiHI 


0,9 KBglll/BamHI) 


FIG. 7 
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a-Lys 17 



12.27 kb 


hygromycin 


(BamHI) 



2.9 

FIG. 8 


Hindm PstI BstEII BamHI 


BamHI 



FIG, 9 


Hindm PnoiII BamHI 



41 


EP 0 368 684 B1 


FRl 


FR2 


KABAT lA 

A07 PGLVKPSQSLSLTCSVTGYSIT SGYYWN WIRCFPGNKLEWMG 

A09 PGLVKPSQSLFLTCSITGFPIT SGYYWI WIRQSPGKPLEWMG 

E03 PGLVKPSQSLSLTCSVTGYSir SGYYWN WIRCFPGNKLEWMG 

GOl PGLVKPSQSLSLTCSVTGYSIT SGYYWN WIRCFPGNKLEWMG 


YISYDGSNNYNPSLKN 
YITHSGETFYNPSLQS 
YISYDGSNNYNPSLKN 
YISYDGSNNYNPSLKN 


KABAT IB 


A06 

25G0-7 

BC3 

G03 

H09 

25C10 

A12 

AOS 

2SG08 

A03 

C07 

H04 


PVLVAPSQSLSITCAVSDFSLT NYGVL WVRQPPGKGLEWLG VIWAGGITNYNSALMS 

PGLVQPSQSLSITCTVSGFSLT SYGVH WVRQSPGKGLEWLG VIWSGGSTDYNAAFIS 

PGLVAPSQSLSITCTVSGFSLT SYGVD WVRQPPGKGLEWLG VIWGGGSTNYNSAlilS 

PGLVQPSQSLSITCTVSGFSLT SYGVH WVRQSPGKGLEWLG VIWSGGSTDY^3AAFIS 

PVLVAPPQSLSITCTVSGFSLT SYGVH WVRQPPGKGLEWLG VIWAGGSTNYNSALMS 

PGLVAPSQSLSITCTVSGFSLT SYAIS WVRQPPGKGLEWLG VIWTGGGTNYNSALKS 

PGLVAPSQSLSITCTVSGFSLT SYAIS WVRQPPGKGLEWU5 VIWTGGGTNYNSALKS 

PGLVAPSQSLSITCTVSGFSLT SYGVH WVRQPPGKGLEW ** * * * * ^GSTTYSSALKS 

PGLVAPSQSLSITCTVSGFSLT SYDVD WVRQSPGKGLEWLG VIWGGGSTNYNSALKS 

PGLVQPSQSLSITCTVSGFSLT SYGVH WVRQSPGKGLEWLG VIWSGGSTDYNAAFIS 

PVLVAPSQSLSITCTVSGFSLT SYGVH WVRQPPGKGLEWLG VIWAGGSTNYNSALMS 

PGLVAPSQSLSITCTVSGFSLT SYGVD WVRQSPGKGLEWLG VIWGVGSTNYNSALKS 


KABAT IIA 


£04 
H07 


PELVRPGVSVKI SCKGSGYTFT 
PELVRPGVSVKI SCKGSGYTFT 


DYAMH 
DYAMH 


WVKQSHAK5LEWIG 
WVKQSHAK5LEWIG 


VISTYYGDASYNQKFKD 
VISTYYGDASYN®<FKD 


KABAT IIB 

A02 AELVMPGASVKLSCKASGYTFT SYWMH 

B04 AELVKPGASVKMSCKASGYTFT SYWIT 

COS AELVKPGASVKLSCKASGYTFT SYWMH 

CO 9 AELVKPGASLKLSCKASGYTFT SYWMH 

DO 6 ASLVKPGASVKMSCKASGYTFT SYWIT 

DOS PELVKPGASVKLSCKASGYTFT SYWMH 

E07 AELVRPGASVKLSCKASGYTFT DYEMH 

GOB PELVKPGASVKISCKASGYTFT DYYIN 

GIO AELVKPGASVKVSCKASGYTFT SYWMH 

25G09 AELVKPGASVKMSCKASGYTFT TYPIE 

F04 TELVKPGASVKLSCKASGYTFT SYWMH 

H02 AELVKPGASVKLSCKASGYTFT SYWMH 

HOI AELVMPGASVKLSCKASGYTFT SYWMH 

25C05 PELVRPGTSVKMSCKASGYTFF ^3YWMK 

BOl AELVKPGASVKMSCKASGYTFT SYWIT 

805 AELVRPGSSVKLSCKDSYFAFM RHAKK 

Bll* AELVKPGASVKMSCKASGYTFT SYWIT 


WVKQRPGQGLEWIG 
WVKQRPGQGLEWIG 
WVKQRPGRGLEWIG 
WVKQRPGQGLEWIG 
WVKQRPGQGLEWIG 
WVKQRPGQGLEWIG 
WVKQTPVHGLEWIG 
WVKQRPGQGLEWIG 
WVKQRPGQGLEWIG 
WVKQNHGKSL2WIG 
WVKQRPGQGLEWIG 
WVKQRPGQGLEWIG 
WVKQRPGQGLEWIG 
^*QRPGOGLEWIG 
WVKQRPGQGLEWIG 
WVKQRPGHGLEWIG 
WVKCa^POQGLEWIG 


EIDPSDSYTNYNQKFKG 
DIYPGSGSTNYNEKFKS 
RIDPNSGGTKYNEKFKS 
EINPSNGGTNYDEKFKS 
DIYPGSGSTNYNEKFKS 
EINPSNGGTNYNEKPKS 
AIDPETGGTAYNQKFKG 
WIYPGSGNTKYNEKFKG 
RIHPSDSDTNYNQKFKG 
NFHPYNDDTKYNEKFKG 
NINPSNGGTNYNQKTKG 
NIDPSDSETHYNQKFKD 
EIDPSDSYTNYNWQG 
QIFPASGSIYYNEMHKD 
DIYPGSGSTNYNEKFKS 
SFTMYSDATEYSENFKG 
DIYPGSGSTNYNEKFKS 


KABAT III A 

25G05 GGLVQAWGSLSLSCAASGFTFT DYYMS 

CIO GGLVQPGGSLSLSCAASGFTFT DYYMN 

BO 7 GGLVQPGGSLSLSCAASGFTFT DYYMS 


WVRQPPGKALEWLG 
WVRQPPGKALEWLA 
WVRQPPGKALEWLA 


FIRNKANGYTTEYSASVKG 
LIRHKANGYTHEYSA5VKS 
LIRNKANGYTTEYSASVKG 


KABAT III B 

G05 GGLVKPGGSLKLSCAASGFTFS DYGNH WVRQAPEKGLEWVA 

812 GGLVQPGESLKLSCESNEYEFP SHDMS WVR***'*****Vn 

D04 GGLVQPGGSLRLSCAASGFTFS SYAMS WJPi'APGKGLENVS 

DOS GGLVQPGGSLRLSCAASGFTFS SYAMS WVA *APGXGL£WVS 

Fi2 GGLVQPGESWKLSCVIQQ**** »»*tt WVPQ*PEKRLELVA 

FOG GGLVQPGGSLRLSCAASGFTFS SYAMS *APGKGLEWVS 

D02 GGLVQPGESLKLSCESNEYVIP ^HDMS WVRQDSGE^LELVA 

F09 GDLVKPGGSLKLSCAASGFTFS SYGMS WVRQTPDKRLEWVA 


YISSGSSTIYYADTVKG 
AmSDGGSTYYPDTMER 
AISGSGGSTYTADSVKG 
AISGSGGSTYYADSVKG 
AINSDGGSTYYPDTMER 
AISGSGGSTYYADSAKC 
AINSDGGSTYYPDTMER 
TISSGGSYTYYPDSVKG 


KABAT III C 

E06 GGLVQPGGSMKLSCAASGFTFS 


DAWMD WVPQSPEKGLEWVA 


EIRNKANNHATYYAESVKG 


KABAT V A 

C04 AELVKPGASVKLSCKASGYTFT 


EYTIH WVKQRSGQGLEWIG 


FIG. 10a 


WFYPGSGSIKYNEKFKD 
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FR 3 


CDR 3 


RI SITRDTSKNQFFLKLNSVTTEDTATYYCAR EGNWDGFAY 

PISITRETSKNQFFLQLNSVTTEDTAMYYCAG DRDKLGPWFAY 

RISITRDTSKNQFFLQLNSVTTEDTATYYCAR DSSGSMDY 

RISITRDTSKNQFFLKLNSVTTEDTATYYCAR VSSGYESKDY 


RLSISKDTSKSQVFLKMNSLQTDDTAVYYCAK 
RLSISKDNSKSQVFFKMNSLQADDTAIYYCAR 
RLSI SKDN SKSQVFLKMNSLQTDDTAMYYCAK 
RLSISKDNSKSQVFFXMaSLQADDTAIYYCAR 
RLSISKDNSKSQVFLKMNSLQTDDTAMYYCAI 
RLSI SKDNSKSQVFLKMNSLQTDDTARYYCAR 
RLS I SKDNS KSQVF LKMNS LQTDDTARY YCAR 
RLSISKDNSKSQ'/FLKMNSLQTDDTAMYYCAR 
RLS I SKDN SKSQVFLKMNSLQTDDTAMYYCAR 
RLSISKDNSKSQVFFKMNSLOADDTAIYYCAR 
RLSISKDNSKSQVFLKMNSLQTDDTAMYYCAK 
BLSISKDNSKSQVFLKMNSLQTDDTAMYYCAS 


HGDSSGYFDY 

NDGYY 

LGRGYAMDY 

KRDYDYDRGYYYAMDY 

YYDGSFFAY 

EGYYYFAY 

lYYDGSSDYYAMDY 

13 nt. 

21 nt. 

28 nt. 

37 nt. 

32 nt. 


Ps .gene/Unproductb 

Unproductive 

Unproductive 

Unproductive 

Unproductive 


KATMTVDKSSSTAYMELARLTSEDSAVYYCAR 
KATMTVDKSSSTAYMELARLTSEDSAVYYCAR 


40 nt. 
22 nt. 


Unproductive 
Unproductive 


KATLTVDKSSSTAYMQLSSLTSEDSAVYYCVR RGLTYAM)Y 

KATLTVDTS SSTAYMQLSSLTSEDSAVYYCAR YY SNYFDY 

KATLTVDKPSSTAYMQLSSLTSEDSAVYYCAR PNWDHYYYGMDV 

KATLTVDKSSSTAYMQLSSLTSEDSAVYYCTL LYYYAMDY 

KATLTVDTS SSTAYMQLSSLTSEDSAVYYCAR SSGYDY 

KATLTVDKSSSTAYMQLSSLTSEDSAVYYCn GAARATNAY 

KATLTVDKS SSTAYMQLSSLTSEDSAVYYCAR GGFAY 

KATLTVDTS SSTAYMQLSSLTSEDSAVYYCAR SPMDY 

KATLTVDKS S STAYMQLS S LTSED SAVY YCAI EVPGGF YATDY 

KATLTVEKS S STVY LELSRLTSDD SAVYYCAR MDY YGS S LWFAY 
KATLTVDKSSSTAYMQLSSLTSEDSAVYYCAK ' TTWAFDY 

KATLTVDKSSSTAYMQLSSLTSEDSAVYYCAR KRDYSTYFDH 

FATLTVDKSSSTAYMOLSSLTSEDSAVYYCAP TGTEFAY 

. KAAmVDTSSSTAYMOLSSLTSEDTAVYFCL* 24 nt. 

KATLTVDKPSDTAYMQLSSLTSEDSASYYCAR 9 nt, 

KATLTANTSSSTAYMELSSLTSEDSAVYYCAR 23 nt. 

KATLTVDTSSSTSYMQLSSLTSEDSAVYYCAR 15 nt. 


Ps.gene 

Ps . gene/Unproductii 
Unproductive 
Unproductive 
Unproductive 


RFTISRDNSQSILYLQKNALRAEDSATYYCAR YMILGAMDY 
RFTISRDNSQSI LYLa^^NALRAEDSATYYCAR GYYYDGSYYAM)Y 
RFTISRDNSQSILYLQP^NALRAEDSATYYCAR 23 nt. 


Unproductive 


RFTISRDNAKNTLr LQMTSLRSEDTAMYYCAR AKFHLYFDY 

RFJISROm'KKTLYLGMSSLRSEDTALYYCAR REGWSSRLDGDV 

RFTISRDNSKNTLYLQMNSLRAEDTAVYYCAD RGLHWFDP 

RFTISRDNSKNTLYLQMNSLRAEDTAmCAK RNYGSSPFDY 

RFIJSRDNSKKTLYL&iSSLRSEDTALYYCAR PPMt^SY 

RFTISRDNSKtn'LYLQMNSLPAEDTAVYYCAK 43 nt. 

RFIISRDNTKKTLYLQMSSLRSEDTALYYCAR 28 nt. 

RFTISRDNAKKTLYLQMSSLKSEDTAMYYCAR 35 nt. 


Ps . gene 
Ps . gene 
Ps.gene 
Ps . gene 

Ps . gene/Unproductl^ 
Ps . gene/Unproducti^ 
Unproductive 


RFTISRDDSKSRVYLQKNSLRAEDTGIYYCTG 30 nt. 


Unproductive 


KATLTADK5SSTVYMELSRLTSEDSAVYFCAR HEDRDSSGYAMDY 

FIG. 10 b 
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CPR 2 


EJWMaSWORK 3 


CPR 3 


KABAT HDHAN VHI 


STSTAYMELRSLRSEDTAWYCAR GEGWDUFDY 
, HAQKFQG RVTIRRHKSTSTAYMELSSLRSEDTAVYYCAR GSRYGYDCSGYYYL 
GYAQKFQG RVTMrRNTSISTATMELSSLRSEDTAVYYCAR LAHFSGSPVDWFDP 


XABAT HUMAN V22 


KHQLQPSLKS RVTISVDTSKNQFSLKLSSVTAADTAVYYCAR GGVA/PAAIMDV 

KS RVTISVDTSKNQFSLKLSSVTAADTAVYYCAR MABYYDFWSGYSAYYDY 

SLKS RLSISQDTSRNQFSLRLSSVTAADTAVYYCAR HRNWGSPVHFDY 

ESTSTAYMELSSLRSEDTAVYYCAR DSYGDYGGHY 


KABAT HUMAN VH3 


ISYITSSSSYTNYADSVKG 
SVKG 
YADSVKG 
YYADSVRD 
DSVKG 

VSAISGSGGSTYYADSVKG 
AVISYDGSNKYYADSVKG 
GAVISYDGSNKYYADSVKG 

QYAASVKG 


RFTISRDNAKNSLYLQMNSLRADDTAVYYCAR 
RFTISRDDSKS I AYLQVNSLKTEDTAVY YCTR 
RFTISRDNAKNSLFLQMSSLRAEDTAFYYCAR 
RFTISRDNSKNTLYLQMNSLRAEDTAVYYCAK 
RFTISRDHAKNSLYLQMNSLRDEOTAVYYCAR 
RFTISRDNPKMTLYLQMNSLRSEDTAVYYCAR 
RFTISRDNSKNTLYLQMNSLRAEDTAVYYCAR 
RFTISRDNSKNTLYLQMNSLRAEOTAVYYCAS 
AKNSLYLQMNSLRAEDTAVYYCVR 
RFTISRDDSKNSLYI^SLNTEDTAVYYCVR 


DGRFGTYSPSDY 
TIYYDSSGYPYW 
GIALDAFOI 

S3 NT. UNPROD REARR 

DHSGTGGGGSGSYF 

KDNLHFDP 

DLGGRGWWPAPGGR5 1 YY YGMDV 

LEGIGTIYYYGMDV 

DDSSSWPKHFQH 

SGVA^YLDY 


KNOWN FAMILY 


AVYYCAR DPRIAARPDYYYYMDV 
TAMYYCAR GAEWEPTARYYYGLNV 


FIG. 11 
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FR2 


YTFT SYGIS 

GEKPGSSVKVSCKASGYTFT DYFMN 

QVQLQEIGPRTGEASETLSLICAVSGDSI S SGNW* I 

aVQLQESGPGLVK*SETLSLTCTVSGGSIS SYYWS 

GYTFT NYCMH 

QVQLQESGPGLVKpSETLSLYCAVSGDSIS SGNW*I 

GPRLGEASETLSLTCTVSGGSI S SSS YYw 

QVQLQESGPGLVKpSETLSLTCTVSGGSI S SYYWS 

LSLICAVSGSSIS SGNW*I 

SETLSLTCAVYGGSFS GYYWS 

QVQLVQSGAEVKKPGASVKVSCKASGYrFT NYCMH 

SETLSLICAVSGDSI S SGNW* I 

SRAQTGEASETLSLTCTVSGGSI S SSS YYWG 

CPLTCTVSGGSVSSGS YYWS 

GLVKPSETLSLTCTVSGGSI S SYYWS 

SFETLSLICAVSGDSIS SGNW*I 

QVQLVQSGAEVKKPGSSVKVSCKASGGTFS SYAIS 

QVQLQWGAGLLKPSETLSLTCAVYGGSFS GYYWS 

QLQLQESGPGLVKPSETLSLTCTVSGGSIS SSSYYWG 

GPGLVKPSQTLSLTCTVSGGSIS SGGYYWS 


WVTTGPWTRDLBWMG 

WMRQAPGQRLEWMG 

WVRQPPGKGLEWIG 

WIrqppGKGLEWIG 

WVRQDHAQGLEWMG 

WVRQPPGKGLEWIG 

WIRQPPGKGLEWIG 

WIRQPPGKGLEWIG 

WVRQPPGKGLEWIG 

WIRQPPGKGLEWIG 

WVRQVLAQGLEWMG 

WVRQPPGKGLEWIG 

WIRQPPGKGLEWIG 

WIRQPPGKGLEWIG 

WIGSPpGKGLEWIG 

WVRQPPGKGLEWIG 

WVRQAPGQGLEWMG 

WIRQPPGKGLEWIG 

WIRQPPGKGLEWIG 

WIRQNPGKGLEWIG 


* indicates stop codon ( unsure as sequence remains in frame) 

• sequence termonates due to internal restriction site J 
lower case denotes frame shift I 


£sm 

WISAYNGNTNYAQKLQG 

WINAGNGNTKYSQKLQG 

EIHHSGSTYYNPSLKS 

RIYTSGSTNYNPSLKS 

LVCPSDGSTSYAQKFQA 

EIHHSGSTYYNPSLKS 

EINHSGSTNYNPSLKS 

YIYYSGSTNYNPSLKS 

EIHHSGSTYYNPSLKS 

EINHSGSTNYNPSLKS 

LVCPSDGSTSYAQKFQA 

EIHHSGSTYYNPSLKS 

SIYYSGSTYYNPSLKS 

YIYYSGSTNYNPSLKS 

RIYTSGSTNYNPSLKS 

EIHHSGSTYYNPSLKS 

RIIPILGIANYAQKFQG 

EINHSGSTNYNPSLKS 

EINHSGSTNYNPSLKS 

YIYYSGSTYYNPSLKS 


cDsa 

RVTMTTDTSTSTAYMSLRSLRSDDTAVYYCAR DTVSS 
RVTITRDTSASTAYMQLSSLRSEDTAVYYCAR DTVSS 
RITMSVDTSKNQFYLKLSS* 

RVTISVDTSKNQFSLKLSSVTAADTAVYYCAR DTVSS 
RVTITRDTSMSTAYMELSSLRSEDTAMYYCAR DTVSS 
RITMSVDTSKNQFYLKLSS • 
RVTISVDTSKNQFSLKLSS • 
RVTISVDTSKNQFSLKLSS • 
RITMSVDTSKNQFYLKLSS* 

RVTISVDTSKNQFSLKLSSVTAADTAVYYCAR DTVSS 
RVTITRDTS^4STAYMELSSLRSEDTAMYYCAR DTVSS 
RITMSVDTSKNQFYLKLSS • 
RVTIPVDTSKNQFSLKLSS • 

RVTISVDTSKNQFSLKLSSVTAADTAVYYCAR DTVSS 
RVTMSVDTSKNQFSLKLSS • 
RITMSVDTSKNQFYLKLSS • 

RVTITADKSTSTAYMELSSLRSEDTAVYYCAR DTVS 

RVTISVDTSKNQFSLKLSS* 

RVTISVDTSKNQFSLKLSS* 

RVTISVDTSKNQFSLKLSSVTAADTAVYYCAR DTVSS 


FIG. 12 
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pSWl 

Hindlll site AAGCTT 

MKYLLPTAA 
GCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCC 
10 20 30 40 50 60 


AGLLLLAAQPAMAQVQLQES 
GCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGCAGGAGTCA 
70 80 90 100 110 120 


GPGLVAPSQSLSITCTVSGF 
GGACCTGGCCTGGTGGCGCCCTCACAGAGCCTGTCCATCACATGCACCGTCTCAGGGTTC 
130 140 150 160 170 180 


SLTGYGVNWVRQPPGKGLEW 
rCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTGGAGTGG 
190 200 210 220 230 240 


LGMIWG DGNTDYNSAL KSRL 
CTGGGAATGATTTGGGGTGATGGAAACACAGACTATAATTCAGCrCTCAAATCCAGACTG 
250 260 270 280 290 300 


SI SKDNSKSQVFLKMNSLHT 
AGCATCAGCAAGGACAACTCCAAGAGCa^GTTTTCTTAAAAATGAACAGTCTGCA»^ 
310 320 330 340 350 360 


DDTARYYCARERD YRLDYWG 
GATCACA»GCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCTTGACTACT^ 

370 380 390 400 410 420 


QGTTVTVSS Smal 
CAAGGCACCACGGTCACCGTCTCCTCATAATAAGAGCTAlI£CGfiSC 

430 440 450 460 470 480 


FIG. 13 


46 


EP 0 368 684 B1 


pSW2 

Hlndlll AAGCTT 

MKYLLPTAA 
GCATGCAAATTCTATTTCAAGGAGACAGTCArAATGAAATACCTATTGCCTACGGCATC^ 
10 20 30 40 50 60 


AGLLLLAAQPAMAQVQLQES 
GCTGGATTGTrATTACTCGCTGCCCAACCAGCGATGGa:CAGGTGCAGCTGCAGGAGTCA 
70 80 90 100 110 120 


G P G L VAPSQSLSITCTVSGF 
GGACCTGGCCrGGTGGCGCCCrCACAGAGCCTGTCCATCA»TO:ACCGTCTCAGre 

130 140 150 160 170 180 


SLT GYGVNWVRQPPGKGLEW 
TCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTGGAGTGG 
190 200 .210. 220 230 240 


LGMIWGDGNTDYNSAL KSRL 
CTGGGAATGATTTGGGGTGATGGAAACACAGACTATAATTCAGCTCTCAAATCCAGACTG 
250 260 270 280 290 300 


SIS KDNSKSQVFLKMNSLHT 
AQ(:».TCAGCAAGGACAACTCCAAGAGCCAAGTTTTCTTAAAAATGAACAGTCTGC^ 

310 320 330 340 350 360 


DDTARYYCARERDYRLDYWG 
GATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCTTGACTACTGGGGC 
370 380 390 400 410 420 


QGTTVTVSS 
CAAGGCACCACGGTCACCGTCTCCTCATAATAAGAGCTCGAATTCGCCAAGCTTGCATGC 
430 440 450 460 470 480 


MKYLLPTAAAG 
AAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCCGCTGGA 
• 490 500 510 520 530 540 


LLLLAAQPAMADIVLTQSPA 
TTGTTATTACTCGCTGCCCAACCAGCGATGGCCGACATCGTCCTGACTCAGTCTCCAGCC 
550 560 570 580 590 600 


SLSASVGETVTITCRASGNI 
TCCCTTTCTGCGTCTGTGGGAGAAACTGTCACCATCACArGTCGAGCAAGTGGGAATATT 
610 620 630 640 650 660 


HNYLAWYQQKQGKSPQLLVY 
CACAATTATTTAGCATGGTATCTVGCAGAAACAGGGAAAATCTCCTCAGCTCXTGGTC^^ 
670 680 690 700 710 720 

FIG. 14 a 
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YTTTLADGVPSRFSGSGSGT 
TATACAACAACCTTAGCAGATGGTGTGCCATCAAGGTTCAGTGGCAGTGGATCAGGAACA 
730 740 750 760 770 780 


QY SL KINSLQP EDFGSYYCQ 
CAATATTCTCTCAAGATCAACAGCCTGa^CCTGAAGATTTTGGGAGTTATTACTGT(^ 
790 800 810 820 830 840 


HFWSTPRTFGGGTKLEI. KR 
CATTTTTGGAGTACTCCTCGGACGTTCGGTGGAGGCACCAAGCTGGAAATCA/UIC^ 

850 860 870 880 890 900 


TAAGAGCTCGAATTC 
910 

FIG. 14 b 


pSWlHPOLYMYC 
Hindlll Site AAGCTT 

MKYLLPTAA 
GCATGCAAATTCrATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCC 
10 20 30 40 50 60 

AGLL LLAAQPAMAQVQLQ 
GCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGCAG 
70 80 90 100 110 PstI 

Poly linker 
TCTAGA GTCGAC CTC6AG 
Xbal Sail Xhpl 

MYC PEPTIDE 

V T V S S EQKL TflEED L N * * 

GGTCACCGTCTCCTCAGAACAAAAACTCATCTCAGAAGAGGATCTGAATTAATAA 

BStEII 

GGGCTAAGCTCGAATTC 

FIG. 15 
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VH3 QVQLQESGPELVKPGASVKMSCKASGYTFT 
VH8 QVQLQESGPELVKPGASVKMSCKASGYTFT 
VH-Dl . 3 QVQLKESGPGLVAP SQSLS ITCTVSGFSLT 


50 


CDR2 


VH3 
VH8 

VH-Dl. 3 


YINPYNDGTKYNEKFKG 
Y INP YNDGSK YNEKFKG 
MIW GDGNTDYNSALKS 


CPRl 

SYVMH 
SYVMH 
GYGVN 


I I f 


49 


WVKQKPGAGLEWI G 
WVKQKPGQGLEWIG 
WVRQPPGKGLEWLG 


94 


KATLTSDKSSSTAYMELSSLTSEDSAVYYCAV 
KATLTADKSSNTAYMQLSSLTSEDSAVYYCAR 
RLSISKDNSKSQVFLKMNSLHTDDTARYYCAR 


95 CDR3 


VH3 
VH8 

VH-Dl. 3 


LLLRYFFDY 

GAWSYYAMDY 

ERDYRLDY 


113 

WGQGTTVTVSS 
WGQGTTVTVSS 
WGQGTTLTVSS 


FIG. 16 


FR1 

QVQLQESGGGLVQPGGSLRLSCAASGFTFS 


SYAMS CDR1 

FR2 

WVRQAPGKGLEWVS 


AISGSGGSTYYADSVKG cdr2 

FR3 

RFTISRDNSKNTLYLQMNSLRAEDTAVYYCAM 


WRGIATPVSFDLGYFDY cdr3 

FIG. 17 
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Psti 


BstEII 


-C 


Pst! 
J 


CDR3 


BstEII 


rearranged VH genes 
from immunised mouse 
spleen DNA 


VHD1.3 gene 


PGR amplify rearranged VH genes or 
VHD1 .3. Excise VH band from gel. 
Clone into vector for expression of VH 
domains in E.coli 


pSWIHPOLYMYC 



pSWIHPOLY 


repertoire of expressed repertoire of expressed 

VH domains from spleen VH domains with mutant 

CDR3 regions 

Assay for binding to antigen 

FIG. 18 
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pSW2HP0LY 
Hindi II AAGCTT 

MKYLLPTAA 
GCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCC 
10 20 30 40 50 60 


AGLLLL*AA QPAMAQVQLQ 
GCTCGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGC3U3 
70 80 90 100 110 PstI 


TCTAGA GTCGAC CTC6AG 
Xbal Sail Xhol 

V T V S S 
GGTCACCGTCTCCTCATAATAAGAGCrCGAATTCGCCAAGCTTGCATGC 
BstEII 430 440 450 460 470 480 


MKYLLPTAAAG 
AAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCCGCrGG^ 
490 500 510 520 530 540 


LLLLAAQPAMADIVLTQSPA 
TTGTTATTACTCGCTGCCO^CCAGCGATGGCCGACATCGTCCTGACTCAGTCTCCAG^^ 
550 560 570 580 590 600 


SLSASVGETVTITCRASGNI 
TCCCTTTCTGCGTCTGTGGGAGAAACTGTCACCATCACATGTCGAGCAAGTGGGAATATT 
610 620 630 640 650 660 


HNYLAWYQQKQGKSPQLLVY 
CACAATTATTTAGCATGGTATCAGCAGAAACAGGGAAAATCTCCTCAGCTCCTGGrCTO^ 
670 680 690 700 710 720 


YTTTLADGVPS RFSGSGSGT 
TATACAACAACCTTAGCAGATGGTGTGCCATCAAGGTTCAGTGG»GTGGATCAGGA^ 

730 740 750 760 770 780 


QYSLKINSLQPEDFGSYYCQ 
CAATATTCTCTCAAGATCAACAGCCTGCAACCTGAAGATTTTGGGAGTTATTACTGTCAA 
790 800 810 820 830 840 


HFWSrPRTFGGGTKLEIKR 
CAITTTTGGAGTACTCCTCGGACGTTCGGTGGAGGCACCAAGCTGGAAATCAAACGGTAA 
850 860 870 880 890 900 


TAAGAGCTCGAATTC 
910 

FIG. 19 
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M K Y L L P T 
AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAA^^^ 

10 20 30 40 50 60 

AAAGLLLLAAQPA-MAQVQLQ 
GCAGCCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGCAG 
70 80 90 100 110 120 

ESGP GLVAPSQSLSITCTVS 
GAGTCAGGACCTGGCCTGGTGGCGCCCTCACAGAGCCTGTCCATCACArGCACCGTCTCA 
130 140 150 160 170 180 

GFSLTGYGVNWVRQPPGKGL 
GGGTTCTCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTG 
190 200 210 220 230 240 

EWLGMIWGDGNTDYNSALKS 
GAGTGGCTGGiGAATGAITTGGGGTGATGGAAACACAGACTATAATTCAGCTCTCAAATC 
250 260 270 280 290 300 

RLSISKDNSKSQVFLKMNSL 
AGACTGAGCATCAGCAAGGACAACTCCAAGAGCaVAGTTTTCTTAAAAATGAACAGTCT^ 
310 320 330 340 - 350 360 

HTDDTARYYCARERDYRLDY 
CACACTGATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCTTGACTAC 
370 380 390 400 410 420 

WGQGTTV TVSSGGGAPAAAP 
TGGGGCCAAGGCACCACGGTCACCGTCTCCTCAGGTGGTGGTGCTCCAGCAGCTGCACCT 
430 440 450 460 470 480 

AGGGQVQLKESGPGLVAPSQ 
GCTGGAGGAGGACAGGTGCAGCTGAAGGAGTCAGGACCTGGCCTGGTGGCGCCCTCACAG 
490 500 510 520 530 540 

SLSITCTVSGFSLTGYGVNW 
AGCCTGTCCATCACATGCACCGTCTCAGGGTTCrCATTAACCGGCTATGGTGTAAACTGG 
550 560 570 580 590 600 

VRQPPGKGLEWLGMIWGDGN 
GrrCGCCAGCCTCCAGGAAAGGGTCTGGAGTGGCTGGGAATGATTTGGGGTGATGGAAAC 
610 620. 630 640 650 660 

V 

TDYNSALKSRLSISKDNSKS 
ACAGACTATAATTCAGCTCTCAAATCCAGACTGAGCATCAGCAAGGACAACTCCAAGAGC 
670 680 690 700 710 720 

QVFLKMNSLHTDDTARYYCA 
CAAGTTTTCTTAAAAATGAACAGTCTGCACACTGATGACACAGCCAGGTACTACTGTGCC 
730 740 750 760 770 780 

RERDYRLDYWGQGTTVTVSS 
AGAGAGAGAGATTATAGGCTTGACTACTGGGGCCAAGGCACCACGGTCACCGTCTCCTCA 
790 800 810 820 830 840 

TAATAAGAGCTC 
850 

FIG. 20 
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MKYLLPTAA 
GCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCC 
10 20 30 40 50 60 


AGLLLLAAQPAM AQ VQLQES 
GCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGCAGGAGTCA 
70 80 90 100 110 120 


GPGIiVAPSQSLSITCTVSGF 
GGACCTGGCCTGGTGGCGCCCTCACAGAGCCTGTCCATCACATGCACCGTCTCAGGGTTC 
130 140 150 160 170 180 


SLTGYGVNWVRQPPGKGLEW 
TCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTGGAGTGG 
190 200 210 220 230 240 


LGMIWGDGNTDY NSALKSRL 
CTGGGAATGATTTGGGGTGATGGAAACACAGACTATAATTCAGCTCTCAAATCCAGACTG 
250 260 270 280 290 300 


SISKDNSKSQVFLKMNSLHT 
AGCATCAGCAAGGACAACTCCAAGAGCCAAGTTTTCTTAAAAATGAACAGTCTGCACACT 
. 310 320 330 340 350 360 


DDTARYYCARERDYRLDYWG 
GATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCTTGACTACTGGGGC 
370 380 390 400 410 420 


QGTTVTVSSRTPEMPVLENR 
CAAGGCACCACGGTCACCGTCTCCTCACGGACACCAGAAATGCCTGTTCTGGAAAACCGG 
430 440 450 460 470 480 


AAQGDITAPGGARRL1GDQT 
GCTGCTCAGGGCGATATTACTGCACCCGGCGGTGCTCGCCGTTTAACGGGTGATCAGACT 
490 500 510 520 530 540 


AALRDSLSDKPAKNIILLIG 
GCCGCTCTGCGTGATTCTCTTAGCGATAAACCTGCAAAAAATATTATTTTGCTGATTGGC 
550 560 570 580 590 600 


DGMGDSEITAARNYAEGAGG 
GATGGGATGGGGGACTCGGAAATTACTGCCGCACGTAATTATGCCGAAGGTGCGGGCGGC 
610 620 630 640 650 660 


FFKGIDALPLTGQYTHYALN 
TTTTTTAAAGGTATAGATGCCTTACCGCTTACCGGGCAATACACTCACTATGCGCTGAAT 
670 680 690 700 710 720 


KKTGKPDYVTDSAASATAWS 
AAAAAAACCGGCAAACCGGACTACGTCACCGACTCGGCTGCATCAGCAACCGCCTGGTCA 
730 740 750 760 770 780 

FIG. 21a 
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TGVKTY NGALGVDIHEKDHP 
ACCGGTGTCAAAACCTATAACGGCGCGCTGGGCGTCGATATTCACGAAAAAGATCACCCA 
790 800 810 820 830 840 


TILEMAKAAGLATGNVS TAE 
ACGATTCTGGAAATGGCAAAAGCCGCAGGTCTGGCGACCGGTAACGTTTCTACCGCAGAG 
850 860 870 880 890 900 


LQDATPAALVAHVTSRKCYG 
TTGCAGGATGCCACGCCCGCTGCGCTGGTGGCACATGTGACCTCGCGCAAATGCTACGGT 
910 920 930 940 950 960 


PSATSEKCPG NALEKG GKG.S 
CCGAGCGCGACCAGTGAAAAATGTCCGGGTAACGCTCTGGAAAAAGGCGGAAAAGGATCG 
970 980 990 1000 1010 1020 


ITEQLLNARADVTLGGGAKT 
ATTACCGAACAGCTGCTTAACGCTCGTGCCGACGTTACGCTTGGCGGCGGCGCAAAAACC 
1030 1040 1050 1060 1070 1080 


FAETATAGEWQGKTLREQAQ 
TTTGCTGAAACGGCAACCGCTGGTGAATGGCAGGGAAAAACGCTGCGTGAACAGGCACAG 
1090 1100 1110 1120 1130 1140 


ARGYQLVSDAASLNSV TEAN 
GCGCGTGGTTATCAGTTGGTGAGCGATGCTGCCTCACTGAATTCGGTGACGGAAGCGAAT 
1150 1160 1170 1180 1190 1200 


Q QKP LLGLFADGNMPVRWLG 
CAGCAAAAACCCCTGCTTGGCCTGTTTGCTGACGGCAATATGCCAGTGCGCTGGCTAGGA 
1210 1220 1230 1240 1250 1260 


PKATYHGNIDKPAVTCTPNP 
CCGAAAGCAACGTAeCATGGCAATATCGATAAGCCCGCAGTCACCTGTACGCCAAATCCG 
1270 1280 1290 1300 1310 1320 


QRNDSVPTLAQMTDKAIELL 
CAACGTAATGACAGTGTACCAACCCTGGCGCAGATGACCGACAAAGCCATTGAATTGTTG 
1330 - 1340 1350 1360 1370 1380 


SKNEKGFFLQVEGASIDKQD 
AGTAAAAATGAGAAAGGCTTTTTCCTGCAAGTTGAAGGTGCGTCAATCGATAAACAGGAT 
1390 1400 1410 1420 1430 1440 


HAANPCGQI GETVDLDEAVQ 
CATGCTGCGAATCCTTGTGGGCAAATTGGCGAGACGGTCGATCTCGATGAAGCCGTACAA 
1450 1460 1470 1480 1490 1500 


RALEFAKKEGNTLVIVTADH 
CGGGCGCTGGAATTCGCTAAAAAGGAGGGTAACACGCTGGTCATAGTCACCGCTGATCAC 
1510 1520 1530 1540 1550 1560 


FIG. 21b 
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AHASQIVAPDTKAPGLTQAL 
GCCCACGCCAGCCAGATTGTTGCGCCGGATACCAAAGCTCCGGGCCTCACCCAGGCGCTA 
1570 1580 1590 1600 1610 1620 


NTKDGAVMVMSYGNSEEDSQ 
AATACCAAAGATGGCGCAGTGATGGTGATGAGTTACGGGAACTCCGAAGAGGATTCACAA 
1630 1640 1650 1660 1670 1680 


EHTGSQLRIAAYGPHAANVV 
GAACATACCGGCAGTCAGTTGCGTATTGCGGCGTATGGCCCGCATGCCGCCAATGTTGTT 
1690 1700 1710 1720 1730 1740 


GLTDQTDLFYTMKAAL GLK* 
GGACTGACCGACCAGACCGATCTCTTCTACACCATGAAAGCCGCTCTGGGGCTGAAATAA 
1750 1760 1770 1780 1790 1800 


AACCGCGCCCGGGAGTGAATTTTCGCTGCCGGGTGGTTTTTTTGCTGTTAGC 
1810 1820 1830 1840 1850 


FIG- 21c 


55 


EP 0 368 684 B1 


MKYLLPTAA 
GCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCC 
10 20 30 40 50 60 


A GLLLLAAQPAMAQVQLQE S 
GCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGCAGGAGTCA 
70 80 90 100 110 120 


GP-GLVAPSQSLS ITCTVSGF 
GGACCTGGCCTGGTGGCGCCCTCACAGAGCCTGTCCATCACATGCACCGTCTCAGGGTTC 
130 140 150 160 170 180 


SLTGYGVNWVRQPPGKGLEW 
TCArrAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTGGAGTGG 
190 200 210 220 230 240 


LGMIWGDGNTDYNSALKS RL 
CTGGGAATGATTTGGGGTGATGGAAACACAGACTATAATTCAGCTCTCAAATCCAGACTG 
250 260 270 280 290 300 


SISKDN SKSQVFLKMNSLHT 
AGCATCAGCAAGGACAACTCCAAGAGCCAAGTTTTCTTAAAAATGAACAGTCTGCACACT 
310 320 330 340 350 360 


DDTARYYCARERD YR LDYWG 
GATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCTTGACTACTGGGGC 
370 380 390 400 410 420 


QGTTVTVS S** 
CAAGGCACCACGGTCACCGTCTCCTCATAATAAGAGCTATCCCGGGAGCTTGCATGCAAA 
430 440 450 460 470 480 


MKYLLPTAA A GL 
TTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCCGCTGGATTG 
490 500 510 520 530 540 


LLLAAQPAMADIELVDLEIK 
TTATTACTCGCTGCCCAACCAGCGATGGCCGACATCGAGCTCGTCGACCTCGAGATCAAA 
550 560 570 580 590 600 


RE'QKLISE EDLN** 
CGGGAACAAAAACTCATCTCAGAAGAGGATCTGAATTAATAATGATCAAACGGTAATAAG 
610 620 630 640 650 660 


GATCCAGCTCGAATTC 
670 

FIG. 22 
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A 

QVQLQESGPGLVQPSQS LSI 
CAGGTGCAGCTGCAGGAGTCAGGACCTGGCCTAGTXXAGCCCTCACAGAGCOT^ 

10 20 30 40 50 60 

G N P 

TCTVSGFSLTSYGVHWVRQ S 
ACCTGCACAGTCTCTGGTTTCTCATTAACTAGCTATGGTGTACACTGGGTTCGCCAGTC^ 

C 

70 80 90 100 110 120 


PGKGLEWLGMIWGDGNTDYN 
CCAGGAAAGGGTCTGGAGTGGCTGGGAATGATTTGGGGTGATGGAAACACAGACTATAAT 
130 140 150 160 170 180 


SA LK SR LSISKDNSKSQVFL 
TO^GCTCTCAAATCCAGACTGAGCATCAGCAAGGACAACTCCAAGAGCCAAGTTCT 

190 200 210 220 230 240 


KMNSLHTDDTARYYCA RERD 
AAAATGAACAGTCTGCACACTGATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGAr 
250 260 270 280 290 300 


YRLDYWGQGTTVTVSS 
TATAGGCTTGACTACTGGGGCCAAGGGACCACGGTCACCGTCTCCTCA 
310 320 330 340 

FIG. 23 
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Description 

[0001] The present invention relates to cloning of immunoglobulin (Ig) variable domain sequences. Methods for clon- 
ing, amplifying and expressing DNA sequences encoding at least part of an immunoglobulin variable domain and 
5 methods for the use of said DNA sequences in the production of Ig-type molecules are disclosed. 

[0002] A list of references is appended to the end of the description. The documents listed therein are referred to in 
the description by number, which is given in square brackets []. 

[0003] The Ig superfamily includes not only the Igs themselves but also such molecules as receptors on lymphoid 
cells such as T lymphocytes. Immunoglobulins comprise at least one heavy and one light chain covalently bonded 

10 together. Each chain is divided into a number of domains. At the N terminal end of each chain is a variable domain. 
The variable domains on the heavy and light chains fit together to form a binding site designed to receive a particular 
target molecule. In the case of Igs, the target molecules are antigens. T-cetl receptors have two chains of equal size, 
the a and p chains, each consisting of two domains. At the N-terminal end of each chain is a variable domain and the 
variable domains on the a and P chains are believed to fit together to form a binding site for target molecules, in this 

IS case peptides presented by a histocompatibility antigen. The variable domains are so called because their amino acid 
sequences vary particularly from one molecule to another. This variation in sequence enables the molecules to rec- 
ognise an extremely wide variety of target molecules. 

[0004] Much research has been candied out on Ig molecules to determine how the variable domains are produced. 
It has been shown that each variable domain comprises a number of areas of relatively conserved sequence and three 
20 areas of hypervariable sequence. The three hypervariable areas are generally known as complementarity determining 
regions (CDRs). 

[0005] Crystallographic studies have shown that in each variable domain of an Ig molecule the CDRs are supported 
on framework areas formed by the areas of conserved sequences. The three CDRs are brought together by the frame- 
work areas and, together with the CDRs on the other chain, fomi a pocket in which the target molecule is received. 
25 [0006] Since the advent of recombinant DNA technology, there has been much Interest in the use of such technology 
to clone and express Ig molecules and derivatives thereof. This Interest is reflected in the numbers of patent applications 
and other publications on the subject. 

[0007] The eariiest v^ork on the cloning and expression of full Igs in the patent literature is EP-A-0 120 694 (Boss). 
The Boss application also relates to the cloning and expression of chimeric antibodies. Chimeric antibodies are Ig-type 
30 molecules in which the variable domains from one Ig are fused to constant domains from another Ig. Usually, the 
variable domains are derived from an Ig from one species (often a mouse Ig) and the constant domains are derived 
from an Ig from a different species (often a human Ig). 

[0008] A later European patent application, EP-A-0 125 023 (Genentech), relates to much the same subject as the 
Boss applicatiori, but also relates to the production by recombinant DNA technology of other variations of Ig-type 
35 molecules. 

[0009] EP-A-0 194 276 (Neuberger) discloses not only chimeric antibodies of the type disclosed in the Boss appli- 
cation but also chimeric antibodies in which some or ail of the constant domains have been replaced by non-lg derived 
protein sequences. For instance, the heavy chain CH2 and CH3 domains may be replaced by protein sequences 
derived from an enzyme or a protein toxin. 
40 [0010] EP-A-0 239 400 (Winter) discloses a different approach to the production of Ig molecules. In this approach, 
only the CDRs from a first type of Ig are grafted onto a second type of Ig in place of its normal CDRs. The Ig molecule 
thus produced is predominantly of the second type, since the CDRs fomri a relatively small part of the whole Ig. However, 
since the CDRs are the parts which define the specificity of the Ig, the Ig molecule thus produced has its specificity 
derived from the first Ig. 

45 [0011] Hereinafter, chimeric antibodies, CDR-grafted Igs, the altered antibodies described by Genentech, and frag- 
ments, of such Igs such as F(ab')2 and Fv fragments are referred to herein as modified antibodies. 
[0012] One of the main reasons for all the activity in the Ig field using recombinant DNA technology is the desire to 
use Igs in therapy. It Is well known that, using the hybridoma technique developed by Kohlerand Milstein, it is possible 
to produce monoclonal antibodies (MAbs) of almost any specificity. Thus, MAbs directed against cancer antigens have 

50 been produced. It is envisaged that these MAbs could be covalently attached or fused to toxins to provide "magic 
bullets" for use in cancer therapy. MAbs directed against nonnal tissue or cell surface antigens have also been pro- 
duced. Labels can be attached to these so that they can be used for in vivo imaging. 

[0013] The major obstacle to the use of such MAbs In therapy or in vivo diagnosis is that the vast majority of MAbs 
which are produced are of rodent, in particular mouse, origin. It is very difficult to produce human MAbs. Since most 
55 MAbs are derived from non-human species, they are antigenic in humans. Thus, administration of these MAbs to 
humans generally results in an anti-lg response being mounted by the human. Such a response can interfere with 
therapy or diagnosis, for instance by destroying or clearing the antibody quickly, or can cause allergic reactions or 
immune complex hypersensitivity which has adverse effects on the patient. 
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[0014] The production of modified Igs has been proposed to ensure that the Ig administered to a patient is as "human" 
as possible, but still retains the appropriate specificity. It is therefore expected that modified Igs will be as effective as 
the MAb from which the specificity is derived but at the same time not very antigenic. Thus, it should be possible to 
use the modified Ig a reasonable number of times in a treatment or diagnosis regime. 

5 [0015] At the level of the gene, it is known that heavy chain variable domains are encoded by a "rearranged" gene 
which is built from three gene segments: an "unrearranged" VH gene (encoding the N-terminal three framework regions, 
first two complete CDRs and the first part of the third CDR), a diversity (DH)-segment (DH) (encoding the central portion 
of the third CDR) and a joining segment (JH) (encoding the last part of the third CDR and the fourth framework region). 
In the maturation of B-cells, the genes rean-ange so that each unrearranged VH gene is linked to one DH gene and 

10 one JH gene. The rearranged gene corresponds to VH-DH-JH. This rearranged gene is linked to a gene which encodes 
the constant portion of the Ig chain. 

[0016] For light chains, the situation is similar, except that for light chains there is no diversity region. Thus light chain 
variable domains are encoded by an "unrearranged" VL gene and a JL gene. There are two types of light chains, kappa 
(k) or lambda (X), which are built respectively from unrearranged Vk genes and Jk segments, and from unrean^anged 
15 VA, genes and JX segments. 

[0017] It has been discovered that isolated Ig heavy chain variable domains can bind to antigen in a 1:1 ratio and 
with binding constants of equivalent magnitude to those of complete antibody molecules. 

[0018] Single domain llgands consisting of at least part of the variable domain of one chain of a molecule from the 
Ig superfamily may be the end product of processes involving methods according to the present invention. 

20 [0019] Preferably, each ligand consists of the variable domain of an Ig light, or, most preferably, heavy chain. 

[0020] If desired, a gene for a single domain ligand can be mutated to improve the properties of the expressed 
domain, for example to increase the yields of expression or the solubility of the ligand, to enable the ligand to bind 
better, or to Introduce a second site for covalent attachment (by Introducing chemically reactive residues such as 
cysteine and histidlne) or non-covalent binding of other molecules. In particular it would be desirable to introduce a 

25 second site for binding to serum components, to prolong the residence time of the domains in the serum; or for binding 
to molecules with effector functions, such as components of complement, or receptors on the surfaces of cells. 
[0021] Thus, hydrophobic residues which would normally be at the interface of the heavy chain variable domain with 
the light chain vahable domain could be mutated to more hydrophillc residues to Improve solubility; residues in the 
CDR loops could be mutated to improve antigen binding; residues on the other loops or parts of the |3-sheet could be 

30 mutated to introduce new binding activities. Mutations could include single point mutations, multiple point mutations 
or more extensive changes and could be introduced by any of a variety of recombinant DNA methods, for example 
gene synthesis, site directed mutagenesis or the polymerase chain reaction. 

[0022] Since these llgands have equivalent binding affinity to that of complete Ig molecules, the llgands can be used 
in many of the ways as are Ig molecules or fragments. For example, Ig molecules have been used in therapy (such 
35 as in treating cancer, bacterial and viral diseases), in diagnosis (such as pregnancy testing), in vaccination (such as 
in producing antl-ldiotypic antibodies which mimic antigens), in modulation of activities of hormones or growth factors, 
in detection, in biosensors and in catalysis. 

[0023] It is envisaged that the small size of the ligands may confer some advantages over complete antibodies, for 
example, in neutralising the activity of low molecular weight drugs (such as digoxin) and allowing their filtration from 
40 the kidneys with drug attached; in penetrating tissues and tumours; in neutralising viruses by binding to small conserved 
regions on the surfaces of viruses such as the "canyon" sites of viruses [16]; in high resolution epitope mapping of 
proteins; and in vaccination by ligands which mimic antigens. 

[0024] A single domain ligand may be linked to one or more of an effector molecule, a label, a surface, or one or 
more other ligands having the same or different specificity, forming a "receptor". 

45 [0025] A receptor comprising a ligand linked to an effector molecule may be of use in therapy. The effector molecule 
may be a toxin, such as ricin or pseudomonas exotoxin, an enzyme which Is able to activate a prodrug, a binding 
partner or a radio-Isotope. The radio-isotope may be directly linked to the ligand or may be attached thereto by a 
chelating structure which is directly linked to the ligand. Such ligands with attached isotopes are much smaller than 
those based on Fv fragments, and could penetrate tissues and access tumours more readily. 

50 [0026] A receptor comprising a ligand linked to a label may be of use in diagnosis. The label may be a heavy metal 
atom or a radio-isotope, In which case the receptor can be used for in vivo imaging using X-ray or other scanning 
apparatus. The metal atom or radio-isotope may be attached to the ligand either directly or via a chelating structure 
directly linked to the ligand. For in vitro diagnostic testing, the label may be a heavy metal atom, a radio-isotope, an 
enzyme, a fluorescent or coloured molecule or a protein or peptide tag which can be detected by an antibody, an 

55 antibody fragment or another protein. Such receptors would be used in any of the known diagnostic tests, such as 
ELISA or fluorescence-linked assays. 

[0027] A receptor comprising a ligand linked to a surface, such as a chromatography medium, could be used for 
purification of other molecules by affinity chromatography. Linking of ligands to cells, for example to the outer membrane 
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proteins of E. co// or to hydrophobic tails which localise the ligands in the cell membranes, could allow a simple diag- 
nostic test in which the bacteria or cells would agglutinate in the presence of molecules bearing multiple sites for binding 
the ligand(s). 

[0028] Receptors comprising at least two ligands can be used, for instance, in diagnostic tests. The first ligand will 
5 bind to a test antigen and the second ligand will bind to a reporter molecule, such as an enzyme, a fluorescent dye, a 
coloured dye, a radio-isotope or a coloured-, fluorescently- or radio-labelled protein. 

[0029] Alternatively, such receptors may be useful in increasing the binding to an antigen. The first ligand will bind 
to a first epitope of the antigen and the second ligand will bind to a second epitope. Such receptors may also be used 
for increasing the affinity and specificity of binding to different antigens in dose proximity on the surface of cells. The 

10 first ligand will bind to the first antigen and the second epitope to the second antigen: strong binding will depend on 
the co-expression of the epitopes on the surface of the cell. This may be useful in therapy of tumours, which can have 
elevated expression of several surface markers. Further ligands could be added to further improve binding or specificity. 
Moreover, the use of strings of ligands, with the same or multiple specificities, creates a larger molecule which Is less 
readily filtered from the circulation by the kidney. 

15 [0030] For vaccination with ligands which mimic antigens, the use of strings of ligands may prove more effective 
than single ligands, due to repetition of the immunising epitopes. 

[0031] If desired, such receptors with multiple ligands could include effector molecules or labels so that they can be 
used in therapy or diagnosis as described above. 

[0032] The ligand may be linked to the other part of the receptor by any suitable means, for instance by covalent or 
20 non-covalent chemical linkages. However, where the receptor comprises a ligand and another protein molecule, it is 

preferred that they are produced by recombinant DNA technology as a fusion product. If necessary, a linker peptide 

sequence can be placed between the ligand and the other protein molecule to provide flexibility. 

[0033] The basic techniques for manipulating Ig molecules by recombinant DNA technology are described In the 

patent references cited above. These may be adapted in order to allow for the production of single domain ligands and 
26 receptors by means of recombinant DNA technology. 

[0034] Preferably, where the ligand is to be used for in vivo diagnosis or therapy in humans, it is humanised, for 

instance by CDR replacement as described in EP-A-0 239 400. 

[0035] In order to obtain a DNA sequence encoding a ligand, it is generally necessary firstly to produce a hybridoma 
which secretes an appropriate MAb. This can be a very time consuming method. Once an immunised animal has been 
30 produced, it is necessary to fuse separated spleen cells with a suitable myeloma cell line, grow up the cell lines thus 
produced, select appropriate lines, redone the selected lines and reselect. This can take some long time. This problem 

also applies to the production of modified Igs. 

[0036] A further problem with the production of ligands, and also receptors described above and modified Igs, by 
recombinant DNA technology is the cloning of the variable domain encoding sequences from the hybridoma which 

35 produces the MAb from which the specificity is to be derived. This can be a relatively long method involving the pro- 
duction of a suitable probe, construction of a clone library from cDN A or genomic DNA, extensive probing of the clone 
library, and manipulation of any isolated clones to enable the cloning into a suitable expression vector. Due to the 
inherent variability of the DNA sequences encoding Ig variable domains, it has not previously been possible to avoid 
such time consuming work. It is therefore a further aim of the present invention to provide a method which enables 

40 substantially any sequence encoding an Ig superfamity molecule variable domain (ligand) to be cloned in a reasonable 
period of time. 

[0037] According to an aspect of the present Invention therefore, there is provided a method of cloning a sequence 
(the target sequence) which encodes at least part of the variable domain of an Ig superfamily molecule, which method 
comprises: 

45 

(a) providing a sample of double stranded (ds) nucleic acid which contains the target sequence; 

(b) denaturing the sample so as to separate the two strands; 

(c) annealing to the sample a forward and a back oligonucleotide primer, the forward primer being specific for a 
sequence at or adjacent the 3' end of the sense strand of the target sequence, the back primer being specific for 

50 a sequence at or adjacent the 3' end of the antisense strand of the target sequence, under conditions which allow 

the primers to hybridise to the nucleic acid at or adjacent the target sequence; 

(d) treating the annealed sample with a DNA polymerase enzyme in the presence of deoxynucleoside triphosphates 
under conditions which cause primer extension to take place; and 

(e) denaturing the sample under conditions such that the extended primers become separated from the target 
55 sequence. 

[0038] Preferably, the method of the present invention further includes the step (f) of repeating steps (c) to (e) on 
the denatured mixture a plurality of times. 
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[0039] Preferably, the method of the present invention is used to clone complete variable domains from Ig molecules, 
most preferably from Ig heavy chains. 

[0040] In a further aspect, the Invention provides an expression library comprising a repertoire of nucleic acid se- 
quences for expression of a repertoire of proteins each comprising an immunoglobulin variable domain. The expression 
5 library may be one which comprises a repertoire of third CDR sequences, said sequences being located in an othennnse 
invariant VH gene. 

[0041] In step (c) recited above, the forward primer becomes annealed to the sense strand of the target sequence 
at or adjacent the 3' end of the strand. In a similar manner, the back primer becomes annealed to the antisense strand 
of the target sequence at or adjacent the 3' end of the strand. Thus, the fonward primer anneals at or adjacent the 

10 region of the ds nucleic acid which encodes the C temninal end of the variable region or domain. Similarly, the back 
primer anneals at or adjacent the region of the ds nucleic acid which encodes the N-terminal end of the variable domain. 
[0042] In step (d), nucleotides are added onto the 3' end of the forward and back primers in accordance with the 
sequence of the strand to which they are annealed. Primer extension will continue in this manner until stopped by the 
beginning of the denaturing step (e). It must therefore be ensured that step (d) is carried out for a long enough time to 

15 ensure that the primers are extended so that the extended strands totally overlap one another. 

[0043] In step (e), the extended primers are separated from the ds nucleic add. The ds nucleic acid can then serve 
again as a substrate to which further primers can anneal. Moreover, the extended primers themselves have the nec- 
essary complementary sequences to enable the primers to anneal thereto. 

[0044] During further cycles, if step (f) is used, the amount of extended primers wilt increase exponentially so that 
20 at the end of the cycles there will be a large quantity of cDNA having sequences complementary to the sense and 
antisense strands of the target sequence. Thus, the method of the present invention will result in the accumulation of 
a large quantity of cDNA which can form ds cDNA encoding at least part of the variable domain. 
[0045] As will be apparent to the skilled person, some of the steps in the method may be carried out simultaneously 
or sequentially as desired. 

25 [0046] The forward and back primers may be provided as isolated oligonucleotides, In which case only two oligonu- 
cleotides will be used. However, alternatively the fonvard and back primers may each be supplied as a mixture of 
closely related oligonucleotides. For instance, it may be found that at a particular point in the sequence to which the 
primer is to anneal, there is the possibility of nucleotide variation. In this case a primer may be used for each possible 
nucleotide variation. Furthermore it may be possible to use two or more sets of "nested" primers in the method to 

30 enhance the specific cloning of variable region genes. 

[0047] The method described above is similar to the method described by Saiki et al. [1 7J. A similar method is also 
used in the methods described in EP-A-0 200 362. In both cases the method described is earned out using primers 
which are known to anneal efficiently to the specified nucleotide sequence. In neither of these disclosures was it sug- 
gested that the method could be used to clone Ig parts of variable domain encoding sequences, where the target 

35 sequence contains inherently highly variable areas. 

[0048] The ds nucleic acid sequence used in the method of the present invention may be derived Irom mRNA. For 
instance, RNA may be isolated in known manner from a cell or cell line which is known to produce Igs. mRNA may be 
separated from other RNA by oligo-dT chromatography. A complementary strand of cDNA may then be synthesised 
on the mRNA template, using reverse transcriptase and a suitable primer, to yield an RNA/DN A heteroduplex. A second 

40 strand of DNA can be made in one of several ways, for example, by priming with RNA fragments of the mRNA strand 
(made by incubating RNA/DNA heteroduplex with RNase H) and using DNA polymerase, or by priming with a synthetic 
oligodeoxynucleotide primer which anneals to the 3' end of the first strand and using DNA polymerase. It has been 
found that the method of the present invention can be carried out using ds cDNA prepared in this way 
[0049] When making such ds cDNA, it is possible to use a fonn/ard primer which anneals to a sequence in the CHI 

45 domain (for a heavy chain variable domain) or the CX, or Ck domain (for a light chain variable domain). These will be 
located in close enough proximity to the target sequence to allow the sequence to be cloned. 
[0050] The back primer may be one which anneals to a sequence at the N-termlnal end of the VH 1 , Vk or domain. 
The back primer may consist of a plurality of primers having a variety of sequences designed to be complementary to 
the various families of VH1 , Vk or VX sequences known. Alternatively the back primer may be a single primer having 

50 a consensus sequence derived from all the families of variable region genes. 

[0051] Surprisingly, it has been found that the method of the present invention can be earned out using genomic 
DNA. If genomic DNA is used, there is a very large amount of DNA present, including actual coding sequences, introns 
and untranslated sequences between genes. Thus, there is considerable scope for non-specific annealing under the 
conditions used. However, it has surprisingly been found that there is very little non-specific annealing. It is therefore 

55 unexpected that it has proved possible to clone the genes of Ig-variable domains from genomic DNA. 

[0052] Under some circumstances the use of genomic DNA may prove advantageous compared with use of mRNA, 

as the mRNA is readily degraded, and especially difficult to prepare from clinical samples of human tissue. 

[0053] Thus, in accordance with an aspect of the present invention, the ds nucleic acid used in step (a) is genomic 
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DNA. 

[0054] When using genomic DNA as the ds nucleic acid source, it will not be possible to use as the fonn/ard primer 
an oligonucleotide having a sequence complementary to a sequence in a constant domain. This is because, in genomic 
DNA, the constant domain genes are generally separated from the variable domain genes by a considerable number 

5 of base pairs. Thus, the site of annealing would be too remote from the sequence to be cloned. 

[0055] It should be noted that the method of the present invention can be used to clone both rearranged and unre- 
arranged variable domain sequences from genomic DNA. It is known that in germ line genomic DNA the three genes, 
encoding the VH, DH and JH respectively, are separated from one another by considerable numbers of base pairs. 
On maturation of the immune response, these genes are rearranged so that the VH, DH and JH genes are fused 

10 together to provide the gene encoding the whole variable domain (see Figure 1). By using a fonA/ard primer specific 
for a sequence at or adjacent the 3' end of the sense strand of the genomic "unrean^anged" VH gene, it is possible to 
clone the "unrearranged" VH gene alone, without also cloning the DH and JH genes. This can be of use in that it will 
then be possible to fuse the VH gene onto pre-cloned or synthetic DH and DH genes. In this way, rearrangement of 
the variable domain genes can be carried out in vitro. 

15 [0056] The oligonucleotide primers used In step (c) may be specifically designed for use with a particular target 
sequence. In this case, it will be necessary to sequence at least the 5' and 3' ends of the target sequence so that the 
appropriate oligonucleotides can be synthesised. However, the present inventors have discovered that it is not neces- 
sary to use such specifically designed primers. Instead, it is possible to use a species specific general primer or a 
mixture of such primers for annealing to each end of the target sequence. This is not partlculariy surprising as regards 

20 the 3' end of the target sequence. It is known that this end of the variable domain encoding sequence leads into a 
segment encoding JH which is known to be relatively conserved. However, it was surprisingly discovered that, within 
a single species, the sequence at the 5' end of the target sequence is sufficiently well conserved to enable a species 
specific general primer or a mixture thereof to be designed for the 5' end of the target sequence. 
[0057] Therefore according to a preferred aspect of the present invention, in step (c) the two primers which are used 

25 are species specific general primers, whether used as single primers or as mixtures of primers. This greatly facilitates 
the cloning of any undetermined target sequence since it will avoid the need to carry out any sequencing on the target 
sequence in order to produce target sequence-specific primers. Thus the method of this aspect of the invention provides 
a general method for cloning variable region or domain encoding sequences of a particular species. 
[0058] Once the variable domain gene has been cloned using the method described above, it may be directly inserted 

30 into an expression vector, for instance using the PGR reaction to paste the gene into a vector. 

[0059] Advantageously, however, each primer includes a sequence including a restriction enzyme recognition site. 
The sequence recognised by the restriction enzyme need not be in the part of the primer which anneals to the ds 
nucleic acid, but may be provided as an extension which does not anneal. The use of primers with restriction sites has 
the advantage that the DNA can be cut with at least one restriction enzyme which leaves 3' or 5' overhanging nude- 

35 otides. Such DNA is more readily cloned into the corresponding sites on the .vectors than blunt end fragments taken 
' . directly from the method. The ds cDNA produced at the end of the cycles will thus be readily insertable into a cloning 
vector by use of the appropriate restriction enzymes. Preferably the choice of restriction sites is such that the ds cDNA 
is cloned directly into an expression vector, such that the ligand encoded by the gene is expressed. In this case the 
restriction site is preferably located in the sequence which is annealed to the ds nucleic acid. 

40 [0060] Since the primers may not have a sequence exactly complementary to the target sequence to which it is to 
be annealed, for instance because of nucleotide variations or because of the introduction of a restriction enzyme rec- 
ognition site, it may be necessary to adjust the conditions in the annealing mixture to enable the primers to anneal to 
the ds nucleic add. This is well within the competence of the person skilled in the art and needs no further explanation. 
[0061] In step (d), any DNA polymerase may be used. Such polymerases are known in the art and are available 

45 commercially. The conditions to be used with each polymerase are well known and require no further explanation here. 
The polymerase reaction will need to be carried out in the presence of the four nucleoside triphosphates. These and 
the polymerase enzyme may already be present in the sample or may be provided afresh for each cycle. 
[0062] The denaturing step (e) may be earned out, for instance, by heating the sample, by use of chaotropic agents, 
such as urea or guanidine, or by the use of changes in ionic strength or pH. Preferably, denaturing is carried out by 

50 heating since this is readily reversible. Where heating is used to canry out the denaturing, it will be usual to use a 
thermostable DNA polymerase, such as Taq polymerase, since this will not need replenishing at each cycle. 
[0063] If heating is used to control the method, a suitable cycle of heating comprises denaturation at about 95°C for 
about 1 minute, annealing at from 30°C to 65°C for about 1 minute and primer extension at about 75°C for about 2 
minutes. To ensure that elongation and renaturation is complete, the mixture after the final cycle is preferably held at 

55 about 60°C for about 5 minutes. 

[0064] The product ds cDNA may be separated from the mixture for instance by gel electrophoresis using agarose 
gels. However, if desired, the ds cDNA may be used in un purified form and inserted directly into a suitable cloning or 
expression vector by conventional methods. This will be particularly easy to accomplish if the primers include restriction 
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enzyme recx)gnition sequences. 

[0065] The method of the present invention may be used to make variations in the sequences encoding the variable 
domains. For example this may be acheived by using a mixture of related oligonucleotide primers as at least one of 
the primers. Preferably the primers are particularly variable in the middle of the primer and relatively conserved at the 

5 5' and 3* ends. Preferably the ends of the primers are complementary to the framework regions of the variable domain, 
and the variable region in the middle of the primer covers all or part of a CDR. Preferably a fonward primer is used in 
the area which forms the third CDR. If the method is earned out using such a mixture of oligonucleotides, the product 
will be a mixture of variable domain encoding sequences. Moreover, variations in the sequence may be introduced by 
incorporating some mutagenic nucleotide triphosphates in step (d), such that point mutations are scattered throughout 

10 the target region. Alternatively such point mutations are Introduced by performing a large number of cycles of amplifi- 
cation, as errors due to the natural error rate of the DNA polymerase are amplified, particularly when using high con- 
centrations of nucleoside triphosphates. 

[0066] The method of this aspect of the present invention has the advantage that it greatly facilitates the cloning of 
variable domain encoding sequences directly from mRNAor genomic DNA. This in turn will facilitate the production of 
15 modified Ig-type molecules by any of the prior art methodes referred to above. Further, target genes can be cloned 
from tissue samples containing antibody producing cells, and the genes can be sequenced. By doing this, it will be 
possible to look directly at the immune repertoire of a patient. This "fingerprinting" of a patient's immune repertoire 
could be of use in diagnosis, for instance of auto-Immune diseases. 

[0067] In step (a) the ds cDNA is derived from mRNA. For Ig derived variable domains, the mRNA is preferably be 
20 isolated from lymphocytes which have been stimulated to enhance production of mRNA. 

[0068] In each step (c) the set of primers are preferably different from the previous step (c), so as to enhance the 
specificity of copying. Thus the sets of primers form a nested set. For example, for cloning of Ig heavy chain variable 
domains, the first set of primers may be located within the signal sequence and constant region, as described by Larrick 
et al., [18], and the second set of primers entirely within the variable region, as described by Oriandi et al., [19]. Pref- 
25 erabiy the primers of step (c) include restriction sites to facilitate subsequent cloning. In the last cycle the set of primers 
used in step (c) should preferably include restriction sites for introduction into expression vectors. Possible mismatches 
between the primers and the template strands may be corrected by "nick translation", ds cDNA is preferably cleaved 
with restriction enzymes al sites introduced Into the primers to facilitate the cloning. 

[0069] According to another aspect of the present invention the product ds cDNA is cloned directly into an expression 
30 vector. The host may be prokaryotic or eukaryotic, but is preferably bacterial. Preferably the choice of restriction sites 
in the primers and in the vector, and other features of the vector will allow the expression of corhplete ligands, while 
preserving all those features of the amino acid sequence which are typical of the (melhoded) ligands. For example, 
for expression of the rearranged variable genes, the primers would be chosen to allow the cloning of target sequences 
including at least all the three CDR sequences. The cloning vector would then encode a signal sequence (for secretion 
35 of the iigand), and sequences encoding the N-temiinal end of the first framework region, restriction sites for cloning 
and then the C-terminal end of the last (fourth) framewori< region. 

[0070] For expression of unrearranged VH genes as part of complete ligands, the primers would be chosen to allow 
the cloning of target sequences including at least the first two CDRs. The cloning vector could then encode signal 
sequence, the N-terminal end of the first framework region, restriction sites for cloning and then the C-terminal end of 
40 the third framework region, the third CDR and fourth firamework region. 

[0071] Primers and cloning vectors may likewise be devised for expression of single CDRs, particularly the third 
CDR, as parts of complete ligands. The advantage of cloning repertoires of single CDRs would penmit the design of a 
"universal" set of framework regions, incorporating desirable properties such as solubility 

[0072] Single ligands could be expressed alone or in combination with a complementary variable domain. For ex- 
45 ample, a heavy chain variable domain can be expressed either as an individual domain or, if it is expressed with a 
complementary light chain variable domain, as an antigen binding site. Preferably the two partners would be expressed 
in the same cell, or secreted from the same cell, and the proteins allowed to associate non-covalently to form an Fv 
fragment. Thus the two genes encoding the complementary partners can be placed in tandem and expressed from a 
single vector, the vector including two sets of restriction sites. 
50 [0073] Preferably the genes are introduced sequentially: for example the heavy chain variable domain can be cloned 
first and then the light chain variable domain. Alternatively the two genes are introduced into the vector in a single step, 
for example by using the polymerase chain reaction to paste together each gene with any necessary intervening se- 
quence, as essentially described by Yon and Fried [29]. The two partners could be also expressed as a linked protein 
to produce a single chain Fv fragment, using similar vectors to those described above. As a further alternative the two 
55 genes may be placed in two different vectors, for example in which one vector is a phage vector and the other is a 
plasmid vector. 

[0074] Moreover, the cloned ds cDNA may be inserted into an expression vector already containing sequences 
encoding one or more constant domains to allow the vector to express Ig-type chains. The expression of Fab fragments, 
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for example, would have the advantage over Fv fragments that the heavy and light chains would tend to associate 
through the constant domains in addition to the variable domains. The final expression product may be any of the 
modified Ig-type molecules referred to above. 

[0075] The cloned sequence may also be inserted into an expression vector so that it can be expressed as a fusion 
5 protein. The variable domain encoding sequence may be linked directly or via a linker sequence to a DNA sequence 
encoding any protein effector molecule, such as a toxin, enzyme, label or another ligand. The variable domain se- 
quences may also be linked to proteins on the outer side of bacteria or phage. Thus, the method of this aspect of the 
invention may be used to produce receptors according to the invention. 

[0076] According to another aspect of the invention, the cloning of ds cDNA directly for expression permits the rapid 
10 construction of expression libraries which can be screened for binding activities. For Ig heavy and light chain variable 
genes, the ds cDNA may comprise variable genes isolated as complete rearranged genes from the animal, or variable 
genes built from several different sources, for example a repertoire of unrearranged VH genes combined with a synthetic 
repertoire of DH and JH genes. Preferably repertoires of genes encoding Ig heavy chain variable domains are prepared 
from lymphocytes of animals immunised with an antigen. 
16 [0077] The screening method may take a range of formats well known in the art. For example Ig heavy chain variable 
domains secreted from bacteria may be screened by binding to antigen on a solid phase, and detecting the captured 
domains by antibodies. Thus the domains may be screened by growing the bacteria in liquid culture and binding to 
antigen coated on the surface of ELI S A plates. However, preferably bacterial colonies (or phage plaques) which secrete 
ligands (or modified ligands, or ligand fusions with proteins) are screened for antigen binding on membranes. Either 
20 the ligands are bound directly to the mernbranes (and for example detected with labelled antigen), or captured on 
antigen coated membranes (and detected with reagents specific for ligands). The use of membranes offers great 
convenience in screening many clones, and such techniques are well known in the art. 

[0078] The screening method may also be greatly facilitated by making protein fusions with the ligands, for example 
by introducing a peptide tag which is recognised by an antibody at the N-terminal or C-terminal end of the ligand, or 

25 joining the ligand to an enzyme which catalyses the conversion of a cotouriess substrate to a coloured product. In the 
latter case, the binding of antigen may be detected simply by adding substrate. Alternatively, for ligands expressed 
. and folded correctly inside eukaryotic cells, joining of the ligand and a domain of a transcriptional activator such as the 
GAL4 protein of yeast, and joining of antigen to the other domain of the GAL4 protein, could fonm the basis for screening 
binding activities, as described by Fields and Song [21]. 

30 [0079] The preparation of proteins, or even cells with multiple copies of the ligands, may improve the avidity of the 
ligand for immobilised antigen, and hence the sensitivity of the screening method. For example, the ligand may be 
joined to a protein subunit of a multimeric protein, to a phage coat protein or to an outer membrane protein of E. cofi 
such as ompA or lamB. Such fusions to phage or bacterial proteins also offers possibilities of selecting bacteria dis- 
playing ligands with antigen binding activities. For example such bacteria may be precipitated with antigen bound to 

35 a solid support, or may be subjected to affinity chromatography, or may be bound to larger cells or particles which have 
been coated with antigen and sorted using a fluorescence activated cell sorter (FACS). The proteins or peptides fused 
to the ligands are preferably encoded by the vector, such that cloning of the ds cDNA repertoire creates the fusion 
product. 

[0080] In addition to screening for binding activities of single ligands, it may be necessary to screen for binding or 
^0 catalytic activities of associated ligands, for example, the associated Ig heavy and light chain variable domains. For 
example, repertoires of heavy and light chain variable genes may be cloned such that two domains are expressed 
together. Only some of the pairs of domains may associate, and only some of these associated pairs may bind to 
antigen. The repertoires of heavy and light chain variable domains could be cloned such that each domain is paired 
at random. This approach may be most suitable for isolation of associated domains in which the presence of both 
45 partners is required to form a cleft. Altematively, to allow the binding of hapten. Altematively, since the repertoires of 
light chain sequences are less diverse than those of heavy chains, a small repertoire of light chain variable domains, 
for example including representative members of each family of domains, may be combined with a large repertoire of 
heavy chain variable domains. 

[0081] Preferably however, a repertoire of heavy chain variable domains is screened first for antigen binding in the 
50 absence of the light chain partner, and then only those heavy chain variable domains binding to antigen are combined 
' with the repertoire of light chain variable domains. Binding of associated heavy and light chain variable domains may 
be distinguished readily from binding of single domains, for example by fusing each domain to a different C-terminal 
peptide tag which are specifically recognised by different monoclonal antibodies. 

[0082] The hierarchical approach of first cloning heavy chain variable domains with binding activities, then cloning 
55 matching light chain variable domains may be particulariy appropriate for the construction of catalytic antibodies, as 
the heavy chain may be screened first for substrate binding. A light chain variable domain would then be identified 
which is capable of association with the heavy chain, and "catalytic" residues such as cysteine or histidine (or prosthetic 
groups) would be introduced Into the CDRs to stabilise the transition state or attack the substrate, as described by 
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Baldwin and Schultz [22]. 

[0083] Although the binding activities of non-covalently associated heavy and light chain variable domains (Fv frag- 
ments) may be screened, suitable fusion proteins may drive the association of the variable domain partners. Thus Fab 
fragments are more likely to be associated than the Fv fragments, as the heavy chain variable domain is attached to 
5 a single heavy chain constant domain, and the light chain variable domain is attached to a single light chain variable 
domain, and the two constant domains associate together. 

[0084] Alternatively the heavy and light chain variable domains are covalently linked together with a peptide, as in 
the single chain antibodies, or peptide sequences attached, preferably at the C-terminal end which will associate 
through forming cysteine bonds or through non-covalent interactions, such as the introduction of "leucine zipper" motifs. 

10 However, In order to Isolate pairs of tightly associated variable domains, the Fv fragments are preferably used. 

[0085] The construction of Fv fragments isolated from a repertoire of variable region genes offers a way of building 
complete antibodies, and an alternative to hybridoma technology. For example by attaching the variable domains to 
light or suitable heavy chain constant domains, as appropriate, and expressing the assembled genes in mammalian 
cells, complete antibodies may be made and should possess natural effector functions, such as complement lysis. 

15 This route is particulariy attractive for the construction of human monoclonal antibodies, as hybridoma technology has 
proved difficult, and for example, although human peripheral blood lymphocytes can k>e immortalised with Epstein Barr 
virus, such hybridomas tend to secrete low affinity IgM antibodies. 

[0086] Moreover, it is known that immmunological mechanisms ensure that lymphocytes do not generally secrete 
antibodies directed against host proteins. However it is desirable to make human antibodies directed against human 
20 proteins, for example to human cell surface markers to treat cancers, or to histocompatibility antigens to treat auto- 
immune diseases. The construction of human antibodies built from the combinatorial repertoire of heavy and light chain 
variable domains may overcome this problem, as it will allow human antibodies to be built with spedfidties which would 
normally have been eliminated. 

[0087] The method also offers a new way of making bispecific antibodies. Antibodies with dual specificity can be 
25 made by fusing two hybridomas of different specificities, so as to make a hybrid antibody with an Fab ami of one 
specificity, and the other Fab arm of a second specificity. However the yields of the bispecific antibody are low, as 
heavy and light chains also find the wrong partners. The construction of Fv fragments which are tightly associated 
should preferentially drive the association of the correct pairs of heavy with light chains. (It would not assist in the 
correct pairing of the two heavy chains with each other.) The improved production of bispecific antibodies would have 
30 a variety of applications in diagnosis and therapy, as is well known. 

[0088] Thus the invention provides a species specific general oligonucleotide primer or a mixture of such primers 
useful for cloning variable domain encoding sequences from animals of that species. The method allows a single pair 
or pair of mixtures of species specific general primers to be used to clone any desired antibody specificity from that 
species. This eliminates the need to carry out any sequencing of the target sequence to be cloned and the need to 
35 design spedfic primers for each specifidty to be recovered. 

[0089] Furthermore it provides for the construction of repertoires of variable genes, for the expression of the variable 
genes directly on doning, for the screening of the encoded domains for binding activities and for the assembly of the 
domains with other variable domains derived from the repertoire. 

[0090] Thus the use of the method of the present invention will allow for the production of heavy chain variable 
40 domains with binding activities and variants of these domains. It allows for the production of monoclonal antibodies 
and bispecific anfibodies, and will provide an alternafive to hybridoma technology. For instance, mouse splenic ds 
mRNA or genomic DNA may be obtained from a hyperimmunised mouse. This could be cloned using the method of 
the present Invention and then the cloned ds DNA inserted Into a suitable expression vector. The expression vector 
would be used to transform a host ceil, for instance a bacterial cell, to enable it to produce an Fv fragment or a Fab 
45 fragment. The Fv or Fab fragment would then be built into a monoclonal antibody by attaching constant domains and 
expressing It in mammalian cells. 

[0091] The present invention is now described, by way of example only, with reference to the accompanying drawings 
in which: 

50 Figure 1 shows a schematic representation of the unrearranged and rearranged heavy and light chain variable 

genes and the location of the primers; 

Figure 2 shows a schematic representation of the M13-VHPCR1 vector and a cloning scheme for amplified heavy 
chain variable domains; 

Figure 3 shows the sequence of the Ig variable region derived sequences in M13-VHPCR1; 
55 Figure 4 shows a schematic representation of the M 1 3-VKPCR 1 vector and a cloning scheme for light chain variable 

domains; 

Figure 5 shows the sequence of the Ig variable region derived sequences in M13-VKPCR1; 

Figure 6 shows the nucleotide sequences of the heavy and light chain variable domain encoding sequences of 
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MAbMBM; 

Figure 7 shows a schematic representation of the pSV-gpt vector (also known as a-Lys 30) which contains a 
variable region cloned as a Hindlll-BamHI fragment, which is excised on introducing the new variable region. The 
gene for human lgG1 has also been engineered to remove a BamHI site, such that the BamHl site in the vector 
5 is unique; 

. Figure 8 shows a schematic representation of the pSV-hygro vector (also known as a-Lys 17). It is derived from 
pSV gpt vector with the gene encoding mycophenolic acid replaced by a gene coding for hygromycin resistance. 
The construct contains a variable gene cloned as a Hindlll-BamHI fragment which is excised on introducing the 
new variable region. The gene for human Ck has also been engineered to remove a BamHI site, such that the 

10 BamHI site in the vector Is unique; 

Figure 9 shows the assembly of the mouse: human MBr1 chimaeric antibody; 
Figure 10 shows encoded amino acid sequences of 48 mouse rearranged VH genes; 
Figure 11 shows encoded amino acid sequences of human rearranged VH genes; 
Figure 1 2 shows encoded amino acid sequences of unrearranged human VH genes; 

15 Figure 13 shows the sequence of part of the plasmid pSW1: essentially the sequence of a pectate lyase leader 

linked to VHLYS in pSWI and cloned as an Sphl-EcoRI fragment into pUC19 and the translation of the open 
reading frame encoding the pectate lyase leader-VHLYS polypeptide being shown; 

Figure 14 shows the sequence of part of the plasmid pSW2: essentially the sequence of a pectate lyase leader 
linked to VHLYS and to VKLYS, and cloned as an Sphl-EcoRI-EcoRI fragment into pUC19 and the translation of 
20 open reading frames encoding the pectate lyase leader-VHLYS and pectate lyase leader-VKLYS polypeptides 

being shown; 

Figure 15 shows the sequence of part of the plasmid pSWIHPOLYMYC which is based on pSWI and in which a 
polylinker sequence has replaced the variable domain of VHLYS, and acts as a cloning site for amplified VH genes, 
, and a peptide tag is introduced at the C-terminal end; 
25 Figure 16 shows the encoded amino acid sequences of two VH domains derived from mouse spleen and having 

lysozyme binding activity, and compared with the VH domain of the D1 ,3 antibody. The arrows mark the points of 
difference between the two VH domains; 

Figure 17 shows the encoded amino acid sequence of a VH domain derived from human peripheral blood lym- 
phocytes and having lysozyme binding activity; 
30 Figure 18 shows a scheme for generating and cloning mutants of the VHLYS gene, which is compared with the 

scheme for cloning natural repertoires of VH genes; 
Figure 19 shows the sequence of part of the vector pSW2HPOLY; 

Figure 20 shows the sequence of part of the vector pSW3 which encodes the two linked VHLYS domains; 
Figure 21 shows the sequence of the VHLYS domain and pelB leader sequence fused to the alkaline phosphatase 
35 gene; 

Figure 22 shows the sequence of the vector pSWI VHLYSVKPOLYMYC for expression of a repertoire of Vk light 
chain variable domains in association with the VHLYS domain; and 

Figure 23 shows the sequence of VH domain which is secreted at high levels from E. colL The differences with 
VHLYS domain are marked. 
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PRIMERS 

[0092] In the Examples described below, the following oligonucleotide primers, or mixed primers were used. Their 
locations are marked on Figure 1 and sequences are as follows: 


VHIFOR 5 ' TGAGGAGACGGTGACCGTGGTCCCTTGGCCCCAG 3 • ; 


HulVHFOR 5 


HU2VHF0R 5' ^* *ww*wwc,««w*wowwc*w«www*w«ww ^ , 

HU3VHF0R 5' CTTGGTGGATGCTGAGGAGACGGTGACC 3»; 
HU4VHF0R 5 • CTTGGTGGATGCTGATGAGACGGTGACC 3 • ; 
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MOJHIFOR 5' TGAGGAGACGGTGACCGTGGTCCCTGCGCCCCAG 3 

)(0jH2F0R 5' TGAGGAGACGGTGACCGTGGTGCCTTGGCCCCAG 3i 

M0JH3F0R 5' TGCAGAGACGGTGACCAGAGTCCCTTGGCCCCAG 2 

M0JH4F0R S' TGAGGAGACGGTGACCGAGGTTCCTTGACCCCA6 3 


HUJHIFOR 5' TGAGGAGACG6TGACCAG6GTGCCCTG6CCCCAG 3 
HUJH2FOR 5* TGA6GAGACGGTGACCAGGGTGCCACGGCCCCAG 3 
HUJH4FOR 5* TGAGGAGACGGTGACCAGGGTTCCTTGGCCCCAG 3 


VKIFOR 5 • GTTAGATCTCCAGCTTGGTCCC 3 ' ; 
VK2F0R 5 • CGTTAGATCTCCAGCTTGGTCCC 3 • ; 
VK3F0R 5» CC6TTTCAGCTCGAGCTTGGTCCC 3'; 


MOJKIFOR 5« CGTTAGATCTCCAGCTTGGTGCC 3'; 
MOJK3FOR 5' GGTTAGATCTCCAGTCTGGTCCC 3'; 
M0JK4FOR 5' CGTTAGATCTCCAACTTTGTCCC 3'; 


HUJKIFOR 5« C6TTAGATCTCCACCTTGGTCCC 3'; 
HUJK3FOR 5' CGTTAGATCTCCACTTTGGTCCC 3'; 
HUJK4F0R 5 • CGTTAGATCTCCACCTTGGTCCC 3 • ; 
HUJK5F0R 5' CGTTAGATCTCCAGTCGTGTCCC 3'; 

VHIBACK 5« AGGT(C/G) (C/A)A{G/A)CTGCAG(G/C)AGTC(T/A)GG 


HU2VHIBACK: 5* CAGGTGCAGCTGCAGCAGTCTGG 3'; 
HUVHIIBACK: 5' CAGGTGCAGCTGCAGGAGTCGGG 3'; 
HU2VHIIIBACK: 5« GAGGTGCAGCTGCAGGAGTCTGG 3'; 
HuVHIVBACK: 5' CAGGTGCAGCTGCAGCAGTCTGG 3*; 
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MOVHIBACK 5» AGGTGCAGCTGCAGGAGTCAG 3*; 

MOVHIIABACK 5* AGGTCCAGCTGCAGCA(G/A)TCTGG 3»; 

MOVHIIBBACK 5» AGGTCCAACTGCAGCAGCCTGG 3«; 

MOVHIIBACK 5« AGGTGAAGCTGCAGGAGTCTGG 3»; 


VKIBACK 


VK2BACK 5' GACATTGAGCTCACCCAGTCTCCA 3»; 

MOVKIIABACK 5' GATGTTCAGCTGACCCAAACTCCA 3' 
MOVKIIBBACK 5 » GATATTCAGCTGACCCAGGATGAA 3 ' } 


HuHeplFOR 
HuOctalBACK 
HuOcta2BACK 
HuOcta3BACK 


5 • C ( A/G) ( C/G) TGAGCTCACTGTGTCTCTCGCACA 3 • ; 


5» 
5* 


CGTGAATATGCAAATAA 3»; 
AGTAGGAGACATGCAAAT 3'; 
CACCACCCACATGCAAAT 3 • ; 


and 


VHMUTl 


GGAGACGGTGACCGTGGTCCCTTGGCCCCAGTAGTCAAG 
NNNNNNNNNNNNCTCTCTGGC 3' (where N is an 
equimolar mixture of T, C, G and A) 


M13 pRIMER 5« AACAGCTATGACCATG 3« (New England Biolabs 

*1201) 

EXAMPLE 1 

Cloning of Mouse Rearranged Variable region genes from hybridomas, assembly of genes encoding chimaeric 
antibodies and the expression of antibodies from myeloma cells 

[0093] VH 1 FOR is designed to anneal with the 3' end of the sense strand of any mouse heavy chain variable domain 
encoding sequence. It contains a BstEII recognition site. VK1FOR is designed to anneal with the 3' end of the sense 
strand of any mouse kappa-type light chain variable domain encoding sequence and contains a Bglll recognition site. 
VHIBACK is designed to anneal with the 3' end of the antisense strand of any mouse heavy chain variable domain 
and contains a PstI recognition site. VK1BACK is designed to anneal with the 3' end of the antisense strand of any 
mouse kappa-type light chain variable domain encoding sequence and contains a Pvull recognition site. 
[0094] In this Example five mouse hybridomas were used as a source of ds nucleic add. The hybridomas produce 
monoclonal antibodies (MAbs) designated MBr1 [23], BW431/26 [24], BW494/32 [25], BW250/183 [24,26] and 
BW704/152 [27]. MAb MBr1 is particulariy interesting in that it is known to be specific for a saccharide epitope on a 
human mammary carcinoma line MCF-7 [28]. 

Cloning via mRNA 

[0095] Each of the five hybridomas referred to above was grown up in roller bottles and about 5 x 10^ celts of each 
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hybrldoma were used to isolate RNA. mRNA was separated from the isolated RNA using oligodT cellulose [29]. First 
strand cDNA was synthesised according to the procedure described by Maniatis et al. [30] as set out below. 
[0096] In order to clone the heavy chain variable domain encoding sequence, a 50 ^il reaction solution which contains 
1 0 ^g mRNA, 20 pmole VH 1 FOR primer, 250 ^iM each of d ATP, dTTP, dCTP and dGTP, 1 0 mM dithiothreitol (DTT), 
5 100 mM Tris.HCI, 10 mM MgCl2 and 140 mM KCI, adjusted to pH 8.3 was prepared. The reaction solution was heated 
at 70 °C for ten minutes and allowed to cool to anneal the primer to the 3' end of the variable domain encoding sequence 
in the mRNA. To the reaction solution was then added 46 units of reverse transcriptase (Anglian Biotec) and the solution 
was then incubated at 42°C for 1 hour to cause first strand cDNA synthesis. 

[0097] In order to clone the light chain variable domain encoding sequence, the same procedure as set out above 
10 was used except that the VK1 FOR primer was used in place of the VH 1 FOR primer. 

Amplification from RNA/DNA hybrid 

[0098] Once the ds RNA/DNA hybrids had been produced, the variable domain encoding sequences were amplified 
15 as follows. For heavy chain variable domain encoding sequence amplification, a 50 \i\ reaction solution containing 5 
ji,l of the ds RNA/DNA hybrid-containing solution, 25 pmole each of VH1 FOR and VH1 BACK primers, 250 ^iM of dATP, 
dTTP, dCTP and dGTP, 67 mM Tris.HCI, 17 mM ammonium sulphate, 10 mM MgCl2, 200 |xg/ml gelatine and 2 units 
Taq polymerase (Cetus) was prepared. The reaction solution was overiaid with paraffin oil and subjected to 25 rounds 
of temperature cycling using a Techne PHC-1 programmable heating block. Each cycle consisted of 1 minute and 95° C 
20 (to denature the nucleic adds), 1 minute at 30°C (to anneal the primers to the nucleic acids) and 2 minutes at 72''C 
(to cause elongation firom the primers). After the 25 cycles, the reaction solution and the oil were extracted twice with 
ether, once with phenol and once with phenol/CHCI3. Thereafter ds cDNA was precipitated with ethanol. The precip- 
itated ds cDNA was then taken up in 50 jnl of water and frozen. 

[0099] The procedure for light chain amplification was exactly as described above, except that the VK1F0R and 
25 VK1 BACK primers were used in place of the VH 1 FOR and VH 1 BACK primers respectively. 

[0100] 5 ^1 of each sample of amplified cDNA was fractionated on 2% agarose gets by electrophoresis and stained 
with ethidium bromide. This showed that the amplified ds cDNA gave a major band of the expected size (about 330 
bp). (However the band for VK DNA of MBrI was very weak. It was therefore excised from the gel and reamplified in 
a second round.) Thus by this simple procedure, reasonable quantities of ds DNA encoding the light and heavy chain 
30 variable domains of the five MAbs were produced. 

Heavy Chain Vector Construction 

[0101] A BstEII recognition site was introduced into the vector M1 3-HuVHNP [31] by site directed mutagenesis [32,33] 

35 to produce the vector M13-VHPCR1 (Figures 2 and 3). 

[0102] Each amplified heavy chain variable domain encoding sequence was digested with the restriction enzymes 
PstI and BstEII. The fragments were phenol extracted, purified on 2% low melting point agarose gels and force cloned 
into vector M13-VHPCR1 which had been digested with PstI and BstEII and purified on an 0.8% agarose gel. Clones 
containing the variable domain inserts were identified directly by sequencing [34] using primers based in the 3' non- 

40 coding variable gene in the Ml 3-VHPCR1 vector. 

[0103] There is an internal PstI site in the heavy chain variable domain encoding sequences of BW431/26. This 
variable domain encoding sequence was therefore assembled in two steps. The 3' Pstl-BstEII fragment was first cloned 
into M13-VHPCR1 , followed in a second step by the 5' PstI fragment. 

45 Light Chain Vector Construction 

[0104] Vector Ml 3mp 18 [35] was cut with Pvull and the vector backbone was blunt ligated to a synthetic Hindlll-Bam- 
Hl polylinker. Vector M13-HuVKLYS [36] was digested with Hindlll and BamHI to isolate the HuVKLYS gene. This 
Hindlll-BamHI fragment was then inserted into the Hindljl-BamHI polylinker site to form a vector M13-VKPCR1 which 
50 lacks any Pvull sites in the vector backbone (Figures 4 and 5). This vector was prepared in E Coli JM110 [22] to avoid 
dam methylation at the Bell site. 

[0105] Each amplified light chain variable domain encoding sequence was digested with Pvull and Bglll. The frag- 
ments were phenol extracted, purified on 2% low melfing point agarose gels and force cloned into vector Ml 3-VKPCR1 
which had been digested with Pvull and Bell, purified on an 0.8% agarose gel and treated with calf intestinal phos- 
55 phatase. Clones containing the light chain variable region inserts were identified directly by sequencing [34] using 
• primers based in the 3' non-coding region of the variable domain in the Ml 3-VKPCR1 vector. 
[0108] The nucleotide sequences of the MBrI heavy and light chain variable domains are shown in Figure 6 with 
part of the flanking regions of the M13-VHPCR1 and M13-VKPCR1 vectors. 
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Antibody Expression 

[0107] The Hindlll-BamHI fragment carrying the MBr1 heavy chain variable domain encoding sequence in 
M13-VHPCR1 was recloned into a pSV-gpt vector with human yi constant regions [37] (Figure 7). The MBr1 light chain 
5 variable domain encoding sequence in M13-VKPCR1 was recloned as a Hindill-BamHI fragment into a pSV vector, 
PSV-hyg-HuCK with a hygromycin resistance marker and a human kappa constant domain (Figure 8). The assembly 
of the genes is summarised in Figure 9. 

[0108] The vectors thus produced were linearised with Pvul (in the case of the pSV-hygro vectors the Pvul digest is 
only partial) and cotransfected into the non-secreting mouse myeloma line NSO [38] by electroporation [39]. One day 

10 after cotransfection, cells were selected in 0.3 p.g/ml mycophenolic acid (MPA) and after seven days in 1 jag/ml MPA. 
After 14 days, four wells, each containing one or two major colonies, were screened by incorporation of ^^C-lysine [40] 
and the secreted antibody detected after precipitation with protein-A Sepharose^w (Phamnada) on SDS-PAGE [41], 
The gels were stained, fixed, soaked in a fluorographic reagent, Amplify'^'^ (Amersham), dried and autoradlographed 
on preflashed film at -70°C for 2 days. 

15 [0109] Supernatant was also tested for binding to the mammary carcinoma line MCF-7 and the colon carcinoma line 
HT-29, essentially as described by Menard et al. [23], either by an indirect immunoflorescence assay on celt suspen- 
sions (using a fluorescein-labelled goat anti-human IgG (Amersham)) or by a solid phase RIA on monolayers of fixed 
cells (using ''25|-protein A (Amersham)). 

[0110] It was found that one of the supernatants from the four wells contained secreted antibody. The chimeric ah- 
20 tibody in the supematant, like the parent mouse MBrI antibody, was found to bind to MCF-7 cells but not the HT-29 
cells, thus showing that the specificity had been property cloned and expressed. 

Example 2 

25 Cloning of rearranged variable genes from genomic DNA of mouse spleen 
Preparation of DN A from spleen. 

[0111] The DNA from the mouse spleen was prepared in one of two ways (although other ways can be used). 

30 [01 12] Method 1 . A mouse spleen was cut into two pieces and each piece was put into a standard Eppendorf tube 
with 200 |il of PBS. The tip of a 1 ml glass pipette was closed and rounded in the blue flame of a Bunsen burner. The 
pipette was used to squash the spleen piece in each tube. The cells thus produced were transferred to a fresh Eppendorf 
tube and the method was repeated three times until the connective tissue of the spleen appeared white. Any connective 
tissue which has been transferred with the cells was removed using a drawn-out Pasteur pipette. The cells were then 

35 washed in PBS and distributed into four tubes. 

[0113] The mouse spleen cells were then sedimented by a 2 minute spin In a Microcentaur centrifuge at low speed 
setting. All the supernatant was aspirated with a drawn out Pasteur pipette. If desired, at this point the cell sample can 
be frozen and stored at -20 °C 

[0114] To the cell sample (once thawed if it had been frozen) was added 500 ^1 of water and 5 |iil of a 10% solution 
40 of NP-40. a non-ionic detergent. The tube was closed and a hole was punched in the lid. The tube was placed on a 
boiling water bath for 5 minutes to disrupt the cells and was then cooled on ice for 5 minutes. The tube was then spun 
for 2 minutes at high speed to remove cell debris. 

[0115] The supernatant was transferred to a new tube and to this was added 125 ^il 5M NaCI and 30 1 M MOPS 
adjusted to pH 7.0. The DNA in the supernatant was absorbed on a Quiagen 5 tip and purified following the manufac- 

45 turer's instructions for lambda DNA. After isopropanol precipitation, the DNA was resuspended in 500 ^1 water. 

[01 16] Method 2. This method is based on the technique described in Maniatis et al. [30]. A mouse spleen was cut 
into very fine pieces and put into a 2 ml glass homogeniser. The cells were then freed from the tissue by several slow 
up and down strokes with the piston. The cell suspension was made in 500 phosphate buffered saline (PBS) and 
transferred to an Eppendorf tube. The cells were then spun for 2 min at low speed in a Microcentaur centrifuge. This 

50 results in a visible separation of white and red cells. The white cells, sedimenting slower, form a layer on top of the red 
cells. The supernatant was carefully removed and spun to ensure that all the white cells had sedimented. The layer of 
white cells was resuspended in two portions of 500 \i\ PBS and transferred to another tube. 
[01 17] The white cells were precipitated by spinning in the Microcentaur centrifuge at low speed for one minute. The 
cells were washed a further two times with 500 ^1 PBS, and were finally resuspended in 200 |xJ PBS. The white cells 

55 were added to 2.5 ml 25 mM EDTA and 10 mM Tris.CI, pH 7.4, and vortexed slowly. While vortexing 25 20% SDS 
was added. The cells lysed immediately and the solution became viscous and clear. 100 ^ of 20 mg/ml proteinase K 
was added and incubated one to three hours at 50 °C. 

[0118] The sample was extracted with an equal volume of phenol and the same volume of chloroform, and vortexed. 
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After centrifuging, the aqueous phase was removed and 1/10 volume 3M ammonium acetate was added. This was 
overlaid with three volumes of cold ethanol and the tube rocked carefully until the DNA strands became visible. The 
DNA was spooled out with a Pasteur pipette, the ethanol allowed to drip off, and the DNA transferred to 1 ml of 10 mM 
Tris.CI pH 7.4, 0.1 mM EDTA in an Eppendorf tube. The DNA was allowed to dissolve in the cold overnight on a roller. 

5 

Amplification from genomic DNA. 

[0119] The DNA solution was diluted 1/10 in water and boiled for 5 min prior to using the polymerase chain reaction 
(PGR). For each PGR reaction, typically 50-200 ng of DNA were used. 

10 [0120] The heavy and light chain variable domain encoding sequences in the genomic DNA isolated from the human 
PBL or the mouse spleen cells was then amplified and cloned using the general protocol described in the first two 
paragraphs of the section headed "Amplification from RNA/DNA Hybrid" in Example 1 , except that during the annealing 
part of each cycle, the temperature was held at 65°G and that 30 cycles were used. Furthermore, to minimise the 
annealing between the 3' ends of the two primers, the sample was first heated to 95°C, then annealed at 65°C, and 

15 only then was the Taq polymerase added. At the end of the 30 cycles, the reaction mixture was held at 60°C for five 
minutes to ensure that complete elongation and renaturation of the amplified fragments had taken place. 
[0121] The primers used to amplify the mouse spleen genomic DNA were VH1F0R and VH1BACK. for the heavy 
chain variable domain and VK2F0R and VK1 BAGK, for the light chain variable domain. (VK2FOR only differs from 
VK1F0R in that it has an extra G residue on the 5* end.) 

20 [0122] Other sets of primers, designed to optimise annealing with different families of mouse VH and Vx genes were 
devised and used in mixtures with the primers above. For example, mixtures of VK1F0R, M0JK1F0R, M0JK3F0R 
and M0JK4F0R were used as forward primers and mixtures of VK1BACK, MOVKIIABACK and MOVKIIBBACK as 
back primers for amplification of Vx genes. Likewise mixtures of VH IFOR, M0JH1F0R, M0JH2F0R, MOJH3FOR 
and M0JH4F0R were used as fonA^ard primers and mixtures of VH1BACK, MOVHIBACK, MOVHIIABACK, MOVHI- 

25 IBBACK, MOVHIIIBACK were used as backward primers for amplification of VH genes. 

[0123] All these heavy chain FOR primers referred to above contain a BstEII site and all the BACK primers referred 
to above contain a PstI site. These light chain FOR and BAGK primers referred to above all contain Bglll and Pvull 
sites respectively. Light chain primers (VK3FOR and VK2BACK) were also devised which utilised different restriction 
sites, Sad and Xhol. 

30 [0124] Typically all these primers yielded amplified DNA of the connect size on gel electrophoresis, although other 
bands were also present. However, a problem was identified in which the 5' and 3' ends of the forward and backward 
primers for the VH genes were partially complementary, and this could yield a major band of "primer-dimer" in which 
the two oligonucleotides prime on each other. For this reason an improved forward primer, VH1FOR-2 was devised in 
which the two 3* nucleotides were removed from VH 1 FOR. 

35 [0125] Thus, the prefen-ed amplification conditions for mouse VH genes are as follows: the sample was made in a 
volume of 50-100 50-100 ng of DNA, VH1 FOR-2 and VH1 BACK primers (25 pmole of each), 250 |j.M of each 
deoxynucleotide triphosphate, 10 mM Tris.HCI, pH 8.8, 50 mM KCI, 1.5 mM MgCl2, and lOOp-g/ml gelatine. The sample 
was overlaid with paraffin oil, heated to 95° C for 2 min, 65° C for 2 min, and then to 72°C: taq polymerase was added 
after the sample had reached the elongation temperature and the reaction continued for 2 min at 72° C. The sample 

40 was subjected to a further 29 rounds of temperature cycling using the Techne PHC-1 programmable heating block. 
[0126] The preferred amplification conditions for mouse Vk genes from genomic DNA are as follows: the sample 
treated as above except with Vk primers, for example VK3F0R and VK2BAGK, and using a cycle of 94° C for one 
minute, 60° C for one minute and 72° C for one minute. 

[01 27] The conditions which were devised for genomic DNA are also suitable for amplification from the cDNA derived 
45 from mRNA from mouse spleen or mouse hybridoma. 

Cloning and analysis of variable region genes 

[0128] The reaction mixture was then extracted twice with 40 ^1 of water-saturated diethyl ether. This was followed 
50 by a standard phenol extraction and ethanol precipitation as described in Example 1 . The DNA pellet was then dissolved 
in 100 ^il 10 mM Tris.CI, 0.1 mM EDTA. 

[0129] Each reaction mixture containing a light chain variable domain encoding sequence was digested with Sad 
and Xhol (or with Pvull and Bglll) to enable it to be ligated into a suitable expression vector. Each reaction mixture 
containing a heavy chain variable domain encoding sequence was digested with Psti and BstEII for the same purpose. 
55 [01 30] The heavy chain variable genes isolated as above from a mouse hyperimmunised with lysozyme were cloned 
into M13VHPCR1 vector and sequenced. The complete sequences of 48 VH gene clones were determined (Figure 
10). All but two of the mouse VH gene families were represented, with frequencies of: VA (1), INC (1 ), IIIB (8), IIIA (3), 
MB (17), IIA (2), IB (12), lA (4). In 30 clones, the D segments could be assigned to families SP2 (14), FL16 (11) and 
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Q52 (5), and in 38 clones the JH minigenes to families JH1 (3), JH2 (7), JH3 (14) and JH4 (14). The different sequences 
of CDR3 marked out each of the 48 clones as unique. Nine pseudogenes and 16 unproductive rearrangements were 

identified. Of the clones sequenced, 27 have open reading frames. 

[0131] Thus the method is capable of generating a diverse repertoire of heavy chain variable genes from mouse 
5 spleen DNA. 

Example 3 

Cloning of rearranged variable genes from mRNA from human perioheral blood lymphocytes 

10 

Preparation of mRNA. 

[0132] Human peripheral blood lymphocytes were purified and mRNA prepared directly (Method 1 ), or mRNA was 
prepared after addition of Epstein Barr virus (Method 2). 

IS [0133] Method 1 . 20 ml of heparinised human blood from a healthy volunteer was diluted with an equal volume of 
phosphate buffered saline (PBS) and distributed equally into 50 ml Falcon tubes. The blood was then underlayed with 
15ml Ficoll Hypaque (Phamnacia 1 0-A-001 -07). To separate the lymphocytes from the red blood cells, the tubes were 
spun for 10 minutes at 1800 rpm at room temperature in an I EC Centra 3E table centrifuge. The peripheral blood 
lymphocytes (PBL) were then collected from the interphase by aspiration with a Pasteur pipette. The cells were diluted 

20 with an equal volume of PBS and spun again at 1500 rpm for 15 minutes. The supernatant was aspirated, the cell 
pellet was resuspended in 1 ml PBS and the cells were distributed into two Eppendorf tubes. 
[0134] Method 2. 40 ml human blood from a patient with HIV in the pre-AIDS condition was layered on Ficoll to 
separate the white cells (see Method 1 above). The white cells were then incubated in tissue culture medium for 4-5 
days. On day 3, they were infected with Epstein Barr virus. The cells were pelleted (approx 2x10^ cells) and washed 

25 in PBS. 

[0135] The cells were pelleted again and lysed with 7 ml 5M guanidine tsothiocyanate, 50 mM Tris, 10 mM EDTA, 
0.1 mM dithiothreitol. The cells were vortexed vigorously and 7 volumes of 4M LiCI added. The mixture was incubated 
at 4°C for 15-20 hrs. The suspension was spun and the supematant resuspended in 3M LiCI and centrifuged again. 
The pellet was dissolved in 2ml 0.1 % SDS, 10 mM Tris HCI and 1 mM EDTA. The suspension was frozen at -20°C, 

30 and thawed by vortexing for 20 s every 1 0 min for 45 min . A large white pellet was left behind and the clear supematant 
was extracted with phenol chloroform, then with chloroform. The RNA was precipitated by adding 1/10 volume 3M 
sodium acetate and 2 vol ethanol and leaving overnight at -20°C. The pellet was suspended in 0.2 ml water and 
reprecipitated with ethanol. Aliquots for cDNA synthesis were taken from the ethanol precipitate which had been vor- 
texed to create a fine suspension. 

35 , [0136] 100 |xl of the suspension was precipitated and dissolved in 20 ^,1 waterforcDNA synthesis [30] using 10 pmole 
of a HUFOR primer (see below) in final volume of 50 jxl. A sample of 5 |xl of the cDNA was amplified as in Example 2 
except using the primers for the human VH gene families (see below) using a cycle of 95°C, 60°C and 72°C. 
[0137] The back primers for the amplification of human DNA were designed to match the available human heavy 
and light chain sequences, in which the different families have slightly different nucleotide sequences at the 5' end. 

40 Thus for the human VH genes, the primers Hu2VHIBACK, HuVHIIBACK, Hu2VHIIIBACK and HuVHIVBACK were 
designed as back primers, and HUJH1F0R, HUJH2FOR and HUJH4F0R as fonvard primers based entirely in the 
variable gene. Another set of fonward primers Hul VHFOR, Hu2VHF0R, Hu3VHF0R, and Hu4VHF0R was also used, 
which were designed to match the human J-regions and the 5' end of the constant regions of different human isotopes. 
[01 38] Using sets of these primers it was possible to demonstrate a band of amplified ds cDNA by gel electrophoresis. 

45 [0139] One such experiment was analysed in detail to establish whether there was a diverse repertoire in a patient 
with HIV infection. It is known that during the course of AIDS, that T-cells and also antibodies are greatly diminished 
in the blood. Presumably the repertoire of lymphocytes is also diminished. In this experiment, for the fonward priming, 
an equimolar mixture of primers Hul VHFOR, Hu2VHFOR, Hu3VHF0R, and Hu4VHF0R (in PCR 25 pmole of primer 
5' ends) was used. For the back priming, the primers Hu2VHIBACK, HuVHIIBACK, Hu2VHIIIBACK and HuVHI VBACK 

50 were used separately in four separate primings. The amplified DNA from the separate primings was then pooled, 
digested with restriction enzymes PstI and BstEII as above, and then cloned into the vector M13VHPCR1 for sequenc- 
ing. The sequences reveal a diverse repertoire (Fig. 11) at this stage of the disease. 

[0140] For human Vxgenes the primers HuJKI FOR, HUJK3FOR,HUJK4FORandHUJK5FOR were used as fon/vard 
primers and VK1 BACK as back primer. Using these primers it was possible to see a band of amplified ds cDNA of the 
55 correct size by gel electrophoresis. 
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Example 4 

Cloning of unrearranged variable gene genomic DNA from human peripheral blood lymphocytes 

5 [0141] Human peripheral blood lymphocytes of a patient with non-Hodgkins lymphoma were prepared as in Example 
3 (Method 1). The genomic DNA was prepared from the PBL using the technique described in Example 2 (Method 2). 
The VH region in the isolated genomic DNA was then amplified and cloned using the general protocol described in the 
first two paragraphs of the section headed "Amplification from RNA/DNA hybrid" in Example 1 above, exceptthat during 
the annealing part of each cycle, the temperature was held at S5°C and that 30 cycles were used. At the end of the 

10 30 cycles, the reaction mixture was held at 60°C for five minutes to ensure that complete elongation and renaturation 
of the amplified fragments had taken place. 

[0142] The fonward primer used was HuHepI FOR, which contains a Sad site. This primer is designed to anneal to 
the 3' end of the unrearranged human VH region gene, and In particular includes a sequence complementary to the 
last three codons in the VH region gene and nine nucleotides downstream of these three codons. 

15 [0143] As the back primer, an equimolar mixture of HuOctalBACK, HuOcta2BACK and HuOcta3BACK was used. 
These primers anneal to a sequence in the promoter region of the genomic DNA VH gene (see Figure 1 ). 5^1 of the 
amplified DNA was checked on 2% agarose gels in TBE buffer and stained with ethidium bromide. A double band was 
seen of about 620 nucleotides which corresponds to the size expected for the unrearranged VH gene. The ds cDNA 
was digested with Sad and cloned into an M13 vector for sequencing. Although there are some sequences which are 

20 identical, a range of different unrearranged human VH genes were identified (Figure 12). 

Example 5 

Cloning Variable Domains with Binding Activities from a Hybridoma 

[0144] The heavy chain variable domain (VHLYS) of the D1.3 (anti-lysozyme) antibody was cloned into a vector 
similar to that described previously [42] but under the control of the lac z promoter, such that the VHLYS domain is 
attached to a pelB leader sequence for export into the periplasm. The vector was constructed by synthesis of the pelB 
leader sequence [43], using overlapping oligonucleotides, and cloning into a pUC 19 vector [35]. The VHLYS domain 
30 of the D1.3 antibody was derived from a cDNA clone [44] and the construct (pSW1 ) sequenced (Figure 13). 

[0145] To express both heavy and light chain variable domains together, the light chain variable region (VKLYS) of 
the D1 .3 antibody was introduced into the pSWI vector, with a pelB signal sequence to give the construct pSW2 (Figure 
14). 

[0146] A strain of E. coli (BMH71-18) [45] was then transformed [46,47] with the plasmid pSWI or pSW2, and colonies 
35 resistant to ampidllin (100 ng/ml) were selected on a rich (2 x TY = per litre of water, 16g Bactotryptone, lOg yeast 
extract, 5g NaCI) plate which contained 1% glucose to repress the expression of variable domain(s) by catabolite 

repression. 

[0147] The colonies were inoculated into 50 ml 2 x TY (with 1 % glucose and 1 00 jig/ml amplcillln) and grown in flasks 
at 37° C with shaking for 12-16 hr. The cells were centrifuged, the pellet washed twice with 50 mM sodium chloride, 

40 resuspended in 2 x TY medium containing 100 |uig/ml ampicillin and the inducer IPTG (1 mM) and grown for a further 
30 hrs at 37°C. The cells were centrifuged and the supernatant was passed through a Nalgene filter (0.45 ^m) and 
then down a 1 - 5 ml lysozyme-Sepharose affinity column. (The column was derived by coupling lysozyme at 10 mg/ 
ml to CNBr activated Sepharose.) The column was first washed with phosphate buffered saline (PBS), then with 50 
mM diethylamine to elute the VHLYS domain (from pSWI) or VHLYS in association with VKLYS (from pSW2). 

45 [0148] The VHLYS and VKLYS domains were identified by SDS polyacrylamide electrophoresis as the correct size. 
In addition, N-terminal sequence determination of VHLYS and VKLYS isolated from a polyacrylamide gel showed that 
the signal peptide had been produced correctly. Thus both the Fv fragment and the VHLYS domains are able to bind 
to the lysozyme affinity column, suggesting that both retain at least some of the affinity of the original antibody. 
[0149] The size of the VHLYS domain was compared by FPLC with that of the Fv fragment on Superose 12. This 

50 indicates that the VHLYS domain is a monomer. The binding of the VHLYS and Fv fragment to lysozyme was checked 
by ELISA, and equilibrium and rapid reaction studies were carried out using fluorescence quench. 
[0150] The ELISA for lysozyme binding was undertaken as follows: 

(1) The plates (Dynatech Immulon) were coated with 200 ^1 per well of 300 p^g/ml lysozyme in 50 mM NaHCOs, 
55 pH 9.6 overnight ar room temperature; 

(2) The wells were rinsed with three washes of PBS, and blocked with 300 ^1 per well of 1% Sainsbur/s instant 
dried skimmed milk powder in PBS for 2 hours at 37°C; 

(3) The wells were rinsed with three washes of PBS and 200 ^il of VHLYS or Fv fragment (VHLYS assodated with 
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VKLYS) were added and incubated for 2 hours at room temperature; 

(4) The wells were washed three times with 0.05% Tween 20 in PBS and then three times with PBS to remove 

detergent; 

(5) 200 III of a suitable dilution (1 :1000) of rabbit polyclonal antisera raised against the FV fragment in 2% skimmed 
5 milk powder in PBS was added to each well and incubated at room temperature for 2 hours; 

(6) Washes were repeated as in (4); 

(7) 200 \l\ of a suitable dilution (1:1000) of goat anti-rabbit antibody (ICN Immunochemicals) coupled to horse 
radish peroxidase, in 2% skimmed milk powder in PBS, was added to each well and Incubated at room temperature 
for 1 hour; 

10 (8) Washes were repeated as in (4); and 

(9) 200 \i\ 2,2'azino-bis(3-ethylbenzthiazolinesulphonicacid) [Sigma] (0.55 mg/ml, with 1 \i\ 20% hydrogen perox- 
ide: water per 10 ml) was added to each welt and the colour allowed to develop for up to 10 minutes at room 
temperature. 

15 [0151] The reaction was stopped by adding 0.05% sodium azide in 50 mM dtric acid pH 4.3. ELISA plates were read 
in a Tltertek Multiscan plate reader. Supernatant from the induced bacterial cultures of both pSWI (VHLYS domain) 
or pSW2 (Fv fragment) was found to bind to lysozyme in the ELISA. 

[0152] The purified VHLYS and Fv fragments were titrated with lysozyme using fluorescence quench (Perkin Elmer 
LS5B Luminescence Spectrometer) to measure the stolchiometry of binding and the affinity constant for lysozyme 
20 [48.49]. The titration of the Fv fragment at a concentration of 30 nM indicates a dissociation constant of 2.8 nM using 
a Scatchard analysis. 

[0153] A similar analysis using fluorescence quench and a Scatchard plot was carried out for VHLYS, at a VHLYS 
concentration of 100 nM. The stoichiometry of antigen binding is about 1 mole of lysozyme per mole of VHLYS (cal- 
culated from plot). (The concentration of VH domains was calculated from optical density at 280 nM using the typical 

25 extinction coefficient for complete immunoglobulins.) Due to possible errors in measuring low optical densities and the 
assumption about the extinction coefficient, the stoichiometry was also measured more carefully. VHLYS was titrated 
with lysozyme as above using fluorescence quench. To detennine the concentration of VHLYS a sample of the stock 
solution was removed, a known amount of norleucine added, and the sample subjected to quantitative amino acid 
analysis. This showed a stoichiometry of 1 .2 mole of lysozyme per mole of VHLYS domain. The dissociation constant 

30 was calculated at about 12 nM. 

[0154] The on-rates for VHLYS and Fv fragments with lysozyme were determined by stopped-flow analysis (HI Tech 
Stop Flow SHU machine) under pseudo-first order conditions with the fragment at a ten fold higher concentration than 
lysozyme [50]. The concentration of lysozyme binding sites was first measured by titration with lysozyme using fluo- 
rescence quench as above. The on rates were calculated per mole of binding site (rather than amount of VHLYS 

35 protein). The on-rate for the Fv fragment was found to be 2.2 x 1 0^ M-"* s"'' at 25°C. The on-rate for the VHLYS fragment 
found to be 3.8 x 10^ M"'' s-*" and the off-rate 0.075 s-"* at 20°C. The calculated affinity constant is 19 nM. Thus the 
VHLYS binds to lysozyme with a dissociation constant of about 19 nM, compared with that of the Fv of 3 nM. 

Example 6 

40 

Cloning complete variable domains with binding activities from mRNA or DNA of antibody-secreting cells 

[0155] A mouse was immunised with hen egg white lysozyme (100 ^.g i.p. day 1 in complete Freunds adjuvant), after 
14 days immunised i.p. again with 100 jxg lysozyme with incomplete Freunds adjuvant, and on day 35 i.v. with 50 jig 
45 lysozyme in saline. On day 39, spleen was harvested. A second mouse was immunised with keyhole limpet haemo- 
cyanin (KLH) in a similar way. The DNA was prepared from the spleen according to Example 2 (Method 2). The VH 
genes were amplified according to the preferred method in Example 2. 

[0156] Human peripheral blood lymphocytes from a patient infected with HIV were prepared as in Example 3 (Method 
2) and mRNA prepared. The VH genes were amplified according to the method described in Example 3, using primers 
so designed for human VH gene families. 

[0157] After the PGR, the reaction mixture and oil were extracted twice with ether, once with phenol and once with 
phenol/CHCl3. The double stranded DNA was then taken up in 50 \i\ of water and frozen. 5 \i\ was digested with PstI 
and BstEII (encoded within the amplification primers) and loaded on an agarose gel for electrophoresis. The band of 
amplified DNA at about 350 bp was extracted. 

55 

Expression of anti-lysozyme activities 

[0158] The repertoire of amplified heavy chain variable domains (from mouse immunised with lysozyme and from 
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human PBLs) was then cloned directly into the expression vector pSW1 HPOLYMYC. This vector is derived from pSW1 
except that the VHLYS gene has been removed and replaced by a polylinker restriction site. A sequence encoding a 
peptide tag was Inserted (Figure 15). Colonies were toothpicked into 1 ml cultures. After induction (see Example 5 for 
details), 1 0 of the supematant from fourteen 1 ml cultures was loaded on SDS-PAGE gels and the proteins transferred 

5 electrophoretically to nitrocellulose. The blot was probed with antibody 9E10 directed against the peptide tag. 

[0159] The probing was undertaken as follows. The nitrocellulose filter was incubated in 3% bovine serum albumin 
(BSA)/TBS buffer for 20 min (10 x TBS buffer is 100 mM Tris.HCI, pH 7.4, 9% w/v NaCI). The filter was incubated in 
a suitable dilution of antibody 9E10 (about 1/500) in 3% BSA/TBS for 1 - 4 hrs. After three washes in TBS (100 ml per 
wash, each wash for 1 0 min), the filter was incubated with 1 :500 dilution of anti-mouse antibody (peroxidase conjugated 

10 anti-mouse Ig (Dakopats)) in 3% BSA/TBS for 1 - 2 hrs. After three washes in TBS and 0.1% Triton X-100 (about 100 
ml per wash, each wash for 10 min), a solution containing 10 ml chloronapthol in methanol (3 mg/ml), 40 ml TBS and 
50 ^1 hydrogen peroxide solution was added over the blot and allowed to react for up to 10 min. The substrate was 
washed out with excess water. The blot revealed bands similar in mobility to VHLYSMYC on the Western blot, showing 
that other VH domains could be expressed. 

15 [0160] Colonies were then toothpicked individually into wells of an ELISA plate (200 for growth and induction. 
They were assayed for lysozyme binding with the 9E1 6 antibody (as in Examples 5 and 7). Wells with lysozyme-binding 
activity were identified. Two positive wells (of 200) were identified firom the amplified mouse spleen DNA and one well 
from the human cDNA. The heavy chain variable domains were purified on a column of lysozyme-Sepharose. The 
affinity for lysozyme of the clones was estimated by fluorescence quench titration as >50nM, The affinities of the two 

20 clones {VH3 and VH8) derived from the mouse genes were also estimated by stop flow analysis (ratio of kgn/koff) as 
1 2 nM and 27 nM respectively. Thus both these clones have a comparable affinity to the VHLYS domain. The encoded 
amino acid sequences of of VH3 and VH8 are given in Figure 16, and that of the human variable domain in Figure 17. 
[0161] A library of VH domains made from the mouse immunised with lysozyme was screened for both lysozyme 
and keyhole limpet haemocyanin (KLH) binding activities. Two thousand colonies were toothpicked in groups of five 

25 into wells of ELISA plates, and the supernatants tested for binding to lysozyme coated plates and separately to KLH 
coated plates. Twenty one supernatants were shown to have lysozyme binding activities and two to have KLH binding 
activities. A second expression library, prepared from a mouse immunised with KLH was screened as above. Fourteen 
supernatants. had KLH binding activities and a single supernatant had lysozyme binding activity. 
[0162] This shows that antigen binding activities can be prepared from single VH domains, and that immunisation 

30 facilitates the isolation of these domains. 

Example 7 

Cloning variable domains with binding activities by mutagenesis. 

35 

[0163] Taking a single rearranged VH gene, it may be possible to derive entirely new antigen binding activities by 
extensively mutating each of the CDRs. The mutagenesis might be entirely random, or be derived from pre-existing 
repertoires of CDRs. Thus a repertoire of CDR3s might be prepared as in the preceding examples by using "universal" 
primers based in the flanking sequences, and likewise repertoires of the other CDRs (singly or in combination). The 
40 CDR repertoires could be stitched into place in the flanking framework regions by a variety of recombinant DNA tech- 
niques. 

[0164] CDRS appears to be the most promising region for mutagenesis as CDR3 is more variable in size and se- 
quence than CDRs 1 and 2. This region would be expected to make a major contribution to antigen binding. The heavy 
chain variable region (VHLYS) of the anti-lysozyme antibody D1 .3 is known to make several important contacts in the 
45 CDR3 region. 

[01 65] Multiple mutations were made in CDR3. The polymerase chain reaction (PCR) and a highly degenerate primer 
were used to make the mutations and by this means the original sequence of CDRS was destroyed. (It would also have 
been possible to construct the mutations in CDRS by cloning a mixed oligonucleotide duplex into restriction sites flanking 
the CDR or by other methods of site-directed mutagenesis). Mutants expressing heavy chain variable domains with 

50 affinities for lysozyme were screened and those with improved affinities or new specificities were identified. 

[0166] The source of the heavy chain variable domain was an M13 vector containing the VHLYS gene. The body of 
the sequence encoding the variable region was amplified using the polymerase chain reaction (PCR) with the mutagenic 
primer VHMUT1 based in CDRS and the MIS primer which is based in the MIS vector backbone. The mutagenic primer 
hypermutates the central four residues of CDRS (Arg-Asp-Tyr-Arg). The PCR was earned out for 25 cycles on a Techne 

55 PHC-1 programmable heat block using 100 ng single stranded M13mp19SW0 template, with 25 pmol of VHMUT1 and 
the M13 primer, 0.5 mM each dNTP, 67mM Tris.HCI, pH 8.8, 10 mM MgCI2, 17 mM (NH4)2S04, 200 [ig/ml gelatine 
and 2.5 units Taq polymerase in a final volume of 50 pJ. The temperature regime was 95°C for 1.5 min, 25°C for 1,5 
min and 72°C for 3 min (However a range of PCR conditions could be used). The reaction products were extracted 
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with phenol/chloroform, precipitated with ethanol and resuspended in 10 mM Iris. HCl and 0.1 nnM EDTA, pH 8.0. 
[0167] The products from the PGR were digested with PstI and BstEII and purified on a 1.5% LGT agarose gel in 

Tris acetate buffer using Geneclean (Bio 101, LaJolla). The gel purified band was ligated into pSW2HP0LY (Figure 
19). (This vector is related to pSW2 except that the body of the VHLYS gene has been replaced by a polylinker.) The 
5 vector was first digested with BstEII and PstI and treated with calfintestinal phosphatase. Aiiquots of the reaction mix 
were used to transform E. coii BMH 71-18 to ampicillin resistance. Colonies were selected on ampicillin (100 \iglm\) 
rich plates containing glucose at 0.8% w/v. 

[0168] Colonies resulting from transfection were picked in pools of five into two 96 well Corning microtitre plates, 
containing 200 jxl 2 x TY medium and 100 |xl TY medium, 100 |ug/ml ampicillin and 1% glucose. The colonies were 
10 grown for 24 hours at 37°C and then cells were washed twice in 200 50 mM NaCI, pelleting the cells In an lEC 
Centra-3 bench top centrifuge with microtitre plate head fitting. Plates were spun at 2,500 rpm for 10 min at room 
temperature. Cells were resuspended in 200 ^1 2 x TY, 100 ^ig/ml ampicillin and 1 mM IPTG (Sigma) to induce expres- 
sion, and grown for a further 24 hr. 

[0169] Cells were spun down and the supematants used in ELISA with lysozyme coated plates and anti-idiotypic 
15 sera (raised in rabbits against the Fv fragment of the D1.3 antibody). Bound anti-idiotypic serum was detected using 
horse radish peroxidase conjugated to anti-rabbit sera (ICN Immunochemicals). Seven of the wells gave a positive 
result in the ELISA. These pools were restreaked for single colonies which were picked, grown up, Induced in microtitre 
plates and rescreened in the ELISA as above. Positive clones were grown up at the 50 ml scale and expression was 
induced. Culture supernatants were purified as in Example 5 on columns of lysozyme-Sepharose and etuates analysed 
20 on SDS-PAGE and staining with Page Blue 90 (BDH). On elution of the column with diethylamine, bands corresponding 
to the VHLYS mutant domains were identified, but none to the VKLYS domains. This suggested that although the 
mutant domains could bind to lysozyme, they could no longer associate with the VKYLS domains. 
[0170] For seven clones giving a positive reaction In ELISA, plasmids were prepared and the. VKLYS gene excised 
by cutting with EcoRI and religating. Thus the plasmids should only direct the expression of the VHLYS mutants. 1.5 
25 ml cultures were grown and induced for expression as above. The cells were spun down and supernatant shown to 
bind lysozyme as above. (Alternatively the amplified mutant VKLYS genes could have been cloned directly into the 
pSWI HPOLY vector for expression. of the mutant activities in the absence of VKLYS.) 

[0171] An ELISA method was devised In which the activities of bacterial supernatants for binding of lysozyme (or 
KLH) were compared. Firstly a vector was devised for tagging of the VH domains at its C4erminal region with a peptide 
30 from the c-myc protein which is recognised by a monoclonal antibody 9E10. The vector was derived from pSWI by a 
BstEII and Smal double digest, and ligation of an oligonucleotide duplex made from 

5* GTC ACC GTC TCC TCA GAA CAA AAA CTC ATC TCA GAA GAG GAT 
CTG AAT TAA TAA 3» and 


5« TTA TTA ATT CAG ATC CTC TTC TGA GAT GAG TTT TTG TTC TGA 
40 G6A 6AC G 3*. 


The VHLYSMYC protein domain expressed after induction was shown to bind to lysozyme and to the 9E10 antibody 
by ELISA as follows: 

45 

(1 ) Falcon (3912) flat bottomed wells were coated with 180 ^il lysozyme (3 mg/ml) or KLH (50 p-g/ml) per well in 
50 mM NaHC03, pH 9.6, and left to stand at room temperature overnight; 

(2) The wells were washed with PBS and blocked for 2 hrs at 37°C with 200 jiU 2% Sainsbury's instant dried 
skimmed milk powder in PBS per welt; 

so (3) The Blocking solution was discarded, and the walls washed out with PBS (3 washes) and 150 ^.1 test solution 

(supernatant or purified tagged domain) pipetted into each well. The sample was incubated at 37°C for 2 hrs; 
(4) The test solution was discarded, and the wells washed out with PBS (3 washes). 100 )xl of 4 )xg/ml purified 
9E10 antibody in 2% Sainsbury's instant dried skimmed milk powder in PBS was added, and incubated at 37''C 
for 2 hrs; 

55 (5) The 9E10 antibody was discarded, the wells washed with PBS (3 washes). 100 ^l of 1/500 dilution of anti- 

mouse antibody (peroxidase conjugated anti-mouse Ig (Dakopats)) was added and Incubated at 37"C for 2 hrs; 

(6) The second antibody was discarded and wells washed three fimes with PBS; and 

(7) 100 \i\ 2,2'azino-bis(3-ethylbenzthiazolinesulphonicacid) [Sigma] (0.55 mg/ml, with 1 ^ 20% hydrogen perox- 
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ide: water per 10 ml) was added to each well and the colour allowed to develop for up to 10 minutes at room 
temperature. 

[0172] The reaction was stopped by adding 0.05% sodium azide in 50 mM citric acid, pH 4.3. ELISA plates were 

5 read in an Titertek Multiscan plate reader. 

[0173] The activities of the mutant supernatants were compared with VHLYS supernatant by competition with the 
VHLYSMYC domain for binding to lysozyme. The results show that supernatant from done VHLYSMUT59 is more 
effective than wild type VHLYS supernatant in competing for VHLYSMYC. Furthermore, Western blots of SDS-PAGE 
aliquots of supernatant from the VHLYS and VHLYSMUT59 domain (using anti-Fv antisera) indicated comparable 

10 amounts of the two samples. Thus assuming identical amounts of VHLYS and VHLYSMUT59, the affinity of the mutant 
appears to be greater than that of the VHLYS domain. 

[0174] To check the affinity of the VHLYSMUT69 domain directly, the clone was grown at the 1 1 scale and 200-300 
\ig purified on lysozyme-Sepharose as in Example 5. By fluorescence quench titration of samples of VHLYS and 
VHLYSMUT59, the number of binding sites for lysozyme were determined. The samples of VHLYS and VHLYSMUT59 

15 were then compared in the competition ELISA with VHLYSMYC over two orders of magnitude. In the competition assay 
each microtitre well contained a constant amount of VHLYSMYC (approximately 0.6 fxg VHLYSMYC). Varying amounts 
of VHLYS or VHLYSMUT59 (3.8 fiM in lysozyme binding sites) were added (0.166 - 25 The final volume and buffer 
concentration in all wells was constant. 9E10 (anti-myc) antibody was used to quantitate bound VHLYSMYC in each 
assay well. The % inhibition of VHLYSMYC binding was calculated for each addition of VHLYS or VHLYSMUT59, after 

20 subtraction of background binding. Assays were carried out in duplicate. The results indicate that VHLYSMUT59 has 
a higher affinity for lysozyme than VHLYS. 

[0175] The VHLYSMUT59 gene was sequenced (after recloning into M13) and shown to be identical to the VHLYS 
gene except for the central residues of CDR3 (Arg-Asp-Tyr-Arg). These were replaced by Thr-Gln-Arg-Pro: (encoded 
by ACACAAAGGCCA). 

25 [0176] A library of 2000 mutant VH clones was screened for lysozyme and also for KLH binding (toothpicking 5 
colonies per well as described in Example 6). Nineteen supematants were identified with lysozyme binding activities 
and four with KLH binding activities. This indicates that new specificites and improved affinities can be derived by 
making a random repertoire of CDR3. 

30 Example 8 

Construction and expression of double domain for lysozyme binding. 

[0177] The finding that single domains have excellent binding activities should allow the construction of strings of 
35 domains (concatamers). Thus, multiple specificities could be built into the same molecule, allowing binding to different 
epitopes spaced apart by the distance between domain heads. Flexible linker regions could be built to space out the 
domains. In principle such molecules could be devised to have exceptional specificity and affinity. 
[0178] Two copies of the cloned heavy chain variable gene of the D1 .3 antibody were linked by a nucleotide sequence 
encoding a flexible linker 

40 

Qly-Qly-Gly-Ala-Pro-Ala-Ala-Ala-Pro-Ala-Gly-Gly-Gly- 

(by several steps of cutting, pasting and site directed mutagenesis) to yield the plasmid pSW3 (Figure 20). The ex- 
45 pression was driven by a lacz promoter and the protein was secreted into the periplasm via a pelB leader sequence 
(as described in Example 5 for expression of pSW1 and PSW2). The protein could be purified to homogeneity on a 
lysozyme affinity column. On SDS polyacrylamide gels, it gave a band of the right size (molecular weight about 26,000). 
The protein also bound strongly to lysozyme as detected by ELISA (see Example 5) using anti-idiotypic antiserum 
directed against the Fv fragment of the D1.3 antibody to detect the protein. Thus, such constructs are readily made 
so and secreted and at least one of the domains binds to lysozyme. 

Example 9 

Introduction of cysteine residue at C-termlnal end of VHLYS 

55 

[0179] A cysteine residue was introduced at the C-temiinus of the VHLYS domain in the vector pSW2. The cysteine 
was introduced by cleavage of the vector with the restriction enzymes BstI and Smal (which excises the C-terminal 
portion of the J segment) and ligation of a short oligonucleotide duplex 
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5' 6TC ACC GTC TCC TCA TGT TAA TAA 3' 

and 

5 

5' TTA TTA ACA TGA GGA GAC G 3-. 

By purification on an affinity column of lysozyme Sepharose it was shown that the VHLYS-Cys domain was expressed 
in association with the VKLYS variable domain, but the overall yields were much lower than the wild type Fv fragment. 
10 Comparison of non-reducing and reducing SDS polyacrylamide gels of the purified Fv-Cys protein indicated that the 
two VH-Cys domains had become linked through the Introduced cysteine residue. 

Example 10 

15 Linking of VH domain with enzyme 

[0180] Linking of enzyme activities to VH domains should be possible by either cloning the enzyme on either the N- 
terminal or the C-terminal side of the VH domain. Since both partners must be active, it may be necessary to design 
a suitable linker (see Example 8) between the two domains. For secretion of the VH-enzyme fusion, it would be pref- 
20 erable to utilise an enzyme which Is usually secreted. In Figure 21 , there Is shown the sequence of a fusion of a VH 
domain with alkaline phosphatase. The alkaline phosphatase gene was cloned from a plasmid carrying the E. coli 
alkaline phosphatase gene in a plasmid pEK48 [51] using the polymerase chain reaction. The gene was amplified with 
the primers 

5» CAC CAC GGT CAC CGT CTC CTC ACG GAC ACC AGA AAT GCC TGT 
TCT G 3« and 

30 5* GCG AAA ATT CAC TCC CGG GCG CGG TTT TAT TTC 3'.. 


The gene was introduced into the vector pSW1 by cutting at BstEII and Smal. The construction (Figure 21) was ex- 
pressed in E. coli strain BMH71-18 as in Example 5 and screened for phosphatase activity using 1 mg/ml p-nitrophe- 
35 nylphosphate as substrate In lOmM diethanoiamlne and 0.5 mM MgC|2, pH 9.5) and also on SDS polyacrylamide geis 
which had been Western blotted (detecting with anti-idiotypic antiserum). No evidence was found for the secretion of 
the linked VHLYS-alkaline phosphatase as detected by Western blots (see Example 5), or for secretion of phosphatase 
activity. 

[0181] However when the construct was transfected into a bacterial strain BL21DE3 [52] which is deficient in pro- 
^0 teases, a band of the conrect size (as well as degraded products) was detected on the Western blots. Furthermore 
phosphatase activity could now be detected in the bacterial supernatant. Such activity is not present in supernatant 
from the strain which had not been transfected with the construct. 

[01 82] A variety of linker sequences could then be introduced at the BstEII site to improve the spacing between the 
two domains. 

45 

Example 11 

Coexpresslon of VH domains with Vk repertoire 

50 [0183] A repertoire of Vk genes was derived by PGR using primers as described in Example 2 from DNA prepared 
from mouse spleen and also from mouse spleen mRNA using the primers VK3F0R and VK2BACK and a cycle of 94°C 
for 1 min, 60 °C for 1 min, 72°C for 2 min. The PGR amplified DNA was fractionated on the agarose gel, the band 
excised and cloned into a vector which carries the VHLYS domain (from the D1.3 antibody), and a cloning site (Sac! 
and Xhol) for cloning of the light chain variable domains with a myc tail (pSWI VHLYS-VKPOLYMYC, Figure 22). 

55 [0184] Clones were screened for lysozyme. binding activities as described in Examples 5 and 7 via the myc tag on 
the light chain variable domain, as this should permit the following kinds of Vic domains to be Identified: 

(1 ) those which bind to lysozyme In the absence of the VHLYS domain; 
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(2) those which associate with the heavy chain and make no contribution to binding of lysozyme; and 

(3) those which associate with the heavy chain and also contribute to binding of lysozyme (either helping or hin- 
dering). 

5 [0185] This would not identify those Vk domains which associated with the VHLYS domain and completely abolished 
its binding to lysozyme. 

[0186] In a further experiment, the VHLYS domain was replaced by the heavy chain variable domain VH3 which had 
been isolated from the repertoire (see Example 6), and then the Vk domains cloned into the vector. (Note that the VH3 
domain has an internal Sad site and this was first removed to allow the cloning of the Vk repertoire as Sacl-Xhol 
10 fragments.) 

[0187] By screening the supernatant using the ELISA described in Example 6, bacterial supernatants will be identified 
which bind lysozyme. 

Example 12 

IS 

High expression of VH domains. 

[0188] By screening several clones from a VH library derived from a mouse immunised with lysozyme via a Western 
blot, using the 9E10 antibody directed against the peptide tag, one clone was noted with very high levels of expression 
20 of the domain (estimated as 25 - 50 mg/l). The clone was sequenced to detennine the nature of the sequence. The 
sequence proved to be closely related to that of the VHLYS domain, except with a few amino acid changes (Figure 
23). The result was unexpected, and shows that a limited number of amino acid changes, perhaps even a single amino 
acid substitution, can cause greatly elevated levels of expression. 

[0189] By making mutations of the high expressing domain at these residues, it was found that a single amino acid 
25 change in the VHLYS domain(Asn 35 to His) is sufficient to cause the domain to be expressed at high levels. 

CONCLUSION 

[0190] It can thus be seen that the present invention enables the cloning, amplification and expression of heavy and 
30 light chain variable domain encoding sequences in a much more simple manner than was previously possible. It also 
shows that isolated variable domains or such domains linked to effector molecules are unexpectedly useful. 
[0191] It will be appreciated that the present invention has been described above by way of example only and that 
variations and modifications may be made by the skilled person without departing from the scope of the invention. 
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35 Claims 

1. A method of cloning sequences (target sequences) each containing a sequence encoding at least part of an im- 
munoglobulin variable domain, which method comprises providing a sample repertoire of nucleic acid containing 
target sequences, and using forward and back primers in the process of copying and cloning of the target sequenc- 
40 es for expression of a repertoire of proteins each comprising at least part of an immunoglobulin variable domain, 

the forward primer being specific for a sequence at or adjacent the 3' end of the sense strand of each of the target 
sequences, the back primer being specific for a sequence at or adjacent the 3' end of the antisense strand of each 
of the target sequences. 

45 2. A method according to claim 1 which method comprises: 

(a) providing a sample repertoire of double-stranded nucleic acid containing target sequences; 

(b) causing the two strands of the doubled-stranded nucleic acid to be separated; 

(c) annealing to the sample a fonvard and a back oligonucleotide primer, the fonvard primer being specific for 
.50 a sequence at or adjacent the 3* end of the sense strand of each of the target sequences, the back primer 

being specific for a sequence at or adjacent the 3' end of the antisense strand of each of the target sequences, 
under conditions which allow the primers to hybridize specifically to the nucleic acid; 

(d) treating the annealed sample with a DNA polymerase enzyme in the presence of deoxynudeoside triphos- 
phates under conditions which cause primer extension to take place, thereby producing double-stranded nu- 

55 cleic acid; 

(e) repeating steps (b) to (d), thereby producing some double-stranded DNA (product DNA) containing only 
the target sequences; 

(f) cloning product DNA into expression vectors for expression of a repertoire of proteins each comprising at 
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least part of an immunoglobulin variable domain. 
3. A method according to claim 2 wherein steps (b) to (d) are repeated a plurality of times before step (f). 
5 4. A method according to claim 1 , which comprises: 

(a) providing a repertoire of mRNA; 

(b) annealing to the mRNA an oligonucleotide primer specific for a sequence at or adjacent the 3' end of each 
of the target sequences on the sense strands, under conditions which allow the primer to hybridize specifically 

10 to the nucleic acid; 

(c) treating the primer-annealed mRNA with a polymerase enzyme in the presence of deoxynucleoside tri- 
phosphates under conditions which cause primer extension to take place, thereby producing antisense cDNA; 

(d) annealing to the cDNA an oligonucleotide primer specific for a sequence at or adjacent the 3' end of each 
of the target sequences on the antisense strands, under conditions which allow the primer to hybridize spe- 

15 cifically to the nucleic acid; 

(e) treating the primer-annealed cDN A with a polymerase enzyme in the presence of deoxynucleoside triphos- 
phates under conditions which cause primer extension to take place, thereby producing double-stranded DNA 
(product DNA); 

(f) cloning product DNA into expression vectors for expression of a repertoire of proteins each comprising at 
20 least part of an immunoglobulin variable domain. 

5. A method according to claim 4 wherein, after step (e) the following steps are performed before step (f): 

(i) causing the two strands of the product DNA to be separated; 

25 (ii) annealing to the separated strands a forward and a back oligonucleotide primer, the forward primer being 

specific for a sequence at or adjacent the 3' end of the sense strand of each of the target sequences, the back 
primer being specific for a sequence at or adjacent the 3' end of the antisense strand of each of the target 
sequences, under conditions which allow the primers to hybridize specifically to the nucleic acid; 
(iii) treating the annealed sample with a DNA polymerase enzyme in the presence of deoxynucleoside triphos- 

30 phates under conditions which cause primer extension to take place, thereby producing double-stranded nu- 

cleic acid: 

6. A method according to any one of claims 1 to 5 wherein the back primer is specific for a sequence at or adjacent 
the 3' end of the antisense strand of the sequences which are contained in the target sequences and which each 

35 encode at least part of an immunoglobulin variable domain. 

7. A method according to any one of claims 1 to 6 wherein the sample repertdre of double-stranded nucleic acid is 
derived from lymphocytes. ^ 

^0 8. A method according to claim 7 wherein the lymphocytes are derived from an animal or human mounting an immune 
response to an antigen. 

9. A method according to claim 7 wherein the lymphocytes are derived from a patient with an auto-immune disease. 

45 10. A method according to claim 1 wherein the sample repertoire of nucleic acid is derived from rearranged immu- 
noglobulin variable region genes. 

1 1 . A method according to claim 1 wherein the sample repertoire of nucleic acid is genomic DNA. 

50 12. A method according to claim 1 wherein the sample repertoire of nucleic acid is derived from unrearranged immu- 
noglobulin variable region genes. 

13. A method according to any one of claims 1 to 12 wherein the target sequence contains a sequence encoding a 
variable domain from an immunoglobulin heavy chain. 

55 

14. A method according to claim 13 wherein the product DNA is inserted into an expression vector for expression of 
single domain ligands selectable by their binding affinity for antigen. 
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15. A method according to any one of claims 1 to 13 wherein product DNA is inserted into an expression vector for 
expression of antibodies or antibody fragments selectable by their binding affinity for antigen. 

16. A method according to any one of claims 1 to 13 wherein the product DNA is inserted Into an expression vector 
5 for expression alone. 

17. The method of any one of claims 1 to 13 wherein the product DNA is inserted into an expression vector for ex- 
pression in combination with a complementary variable domain. 

10 18. A method according to any one of claims 1 to 13 wherein the product DNA is inserted into an expression vector 
already containing sequences encoding one or more constant domains for expression. 

19. A method according to any one of claims 1 to 13 wherein the product DNA is inserted into an expression vector 
for expression as fusion proteins. 

16 

20. A method according to any one of claims 1 to 13 wherein the product DNA is inserted into an expression vector 
for expression with peptide tags. 

21. A method according to any one of claims 1 to 13 wherein product DNA containing sequences encoding at least 
20 immunoglobulin heavy chain variable domains and product DNA containing sequences encoding at least immu- 

noglobulin light chain variable domains are inserted into expression vectors for expression of a combinatorial 
repertoire of complementary variable domains. 

22. A method according to claim 21 wherein the product DNA is inserted into an expression vector already containing 
25 sequences encoding one or more constant domains for expression. 

23. A method according to claim 21 wherein product DNA is inserted into expression vectors for expression as fusion 
proteins. 

30 24. A method according to claim 21 wherein the product DNA is inserted into an expression vector for expression with 
peptide tags. 

25. A method according to any one of claims 1 to 24 wherein the fonward and back primers are provided as single 
oligonucleotides. 

35 

26. A method according'to any one of claims 1 to 24 wherein the fonward primers are supplied as a mixture of oligo- 
nucleotides. 

27. A method according to any one of claims 1 to 24 wherein the back primers are supplied as a mixture of oiigonu- 
40 cleotides. 

28. A method according to any one of claims 1 to 27 wherein each primer includes a sequence encoding a restriction 
enzyme recognition site. 

45 29. A method according to daim 28 wherein the restriction enzyme recognition site is located in the sequence which 
is annealed to the nucleic acid. 

30. A method according to claim 1 wherein the back primer is a general primer useful for cloning a desired antibody 
specificity from a specific species. 

so 

31. A method according to claim 1 wherein the back primer Is a mixture of primers having a variety of sequences 
designed to be complementary to the various families of VH, Vk or V sequences. 

32. An expression library comprising a repertoire of nucleic acid sequences for expression of a repertoire of proteins 
55 each comprising an Immunoglobulin variable domain. 

33. An expression library comprising a repertoire of third CDR sequences, said sequences being located in an other- 
wise invariant VH gene. 
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Patentanspruche 

1 . Verfahren zum Klonieren von Sequenzen (Zielsequenzen), die jeweils eine zumindest fur einen Teil einer variablen 
Immunglobulindomclne kodierende Sequenz enthalten, wobei das Verfahren die Bereitstellung einer Probensamm- 
lung von Zielsequenzen enthaltenden Nucleinsduren und die Verwendung von Vonvdrts- und RQckwSrtspriniern 
im Verfahren des Kopierens und Klonierens der Zielsequenz zur Expression einer Sammlung von Proteinen um- 
fasst. von denen jedes zumindest einen Teil einer variablen ImmunglobulindomSne umfasst, wobei der Vorwdrts- 
primer fur eine Sequenz am Oder benachbart zum 3'-Ende des Sense-Strangs jeder Zielsequenz spezifisch ist 
und der Ruckwdrtsprimer fur eine Sequenz am Oder benachbart zum 3'-Ende des Antisense-Strangs jeder Ziel- 
sequenz spezifisch ist. 

2. Verfahren nach Anspruch 1, wobei das Verfahren Folgendes umfasst: 

(a) Breitstellung einer Probensammlung von Zielsequenzen enthattender doppelstrSngiger NucleinsSure; 

(b) Bewirken einer Trennung der zwei Strange der doppelstrSngigen NucleinsSure; 

(c) Anellieren eines Vorwarts- und eines Ruckwarts-Oligonucleotidprimers an die Probe, wobei der Vorwarts- 
prlmer fur eine Sequenz am Oder benachbart zum 3 -Ende des Sense-Strangs jeder Zielsequenz spezifisch 
ist und der Ruckwartsprimer fur eine Sequenz am Oder benachbart zum 3'-Ende des Antisense-Strangs jeder 
Zielsequenz spezifisch ist, unter Bedingungen, die spezifische Hybridisierung der Primer an die NucleinsSure 
ermGglichen; 

(d) Behandlung der anellierten Probe mit einem DNA-Polymerase-Enzym in Gegenwart eines Desoxynucleo- 
sidtriphosphats unter Bedingungen, die eine Primerextension bewirken, wodurch doppelstrSngige Nuclein- 
sSure erzeugt wird; 

(e) Wiederholung der Schritte (b) bis (d), wodurch doppelstrSngige DNA (Produkt-DNA) erzeugt wird, die nur 
die. Zielsequenzen enthait; 

(f) Klonieren der Produkt-DNA in Expressionsvektoren zur Expression einer Sammlung von Proteinen, von 
denen jedes zumindest einen Teil einer variablen ImmunglobulindomSne umfasst. 

3. Verfahren nach Anspruch 2, worin die Schritte (b) bis (d) vor Schritt (f) mehrmals wiederholt werden. 

4. Verfahren nach Anspruch 1, umfassend: 

(a) die Bereitstellung einer mRNA-Sammlung; 

(b) das Anellieren eines Oiigonucleotidprimers an die mRNA, der fur eine Sequenz am Oder benachbart zum 
3'-Ende des Sense-Strangs jeder Zielsequenz spezifisch ist, unter Bedingungen, die spezifische Hybridisie- 
rung des Primers an die NucleinsSure ermGglichen; 

(c) die Behandlung der mRNA mit anelliertem Primer mit einem Polymerase-Enzym in Gegenwart eines Des- 
oxynucleosidtriphosphats unter Bedingungen, die eine Primerextension bewirken, wodurch Antisense-cDNA 
erzeugt wird; 

(d) das Anellieren eines Oiigonucleotidprimers an die cDNA, der fQr eine Sequenz am oder benachbart zum 
3'-Ende des Antisense-Strangs jeder Zielsequenz spezifisch ist, unter Bedingungen, die spezifische Hybridi- 
sierung des Primers an die Nucleinsaure ermfiglichen; 

(e) die Behandlung der cDNA mit anellierten Primer mit einem Polymerase-Enzym in Gegenwart eines Des- 
oxynucleosidtriphosphats unter Bedingungen, die eine Primerextension bewirken, wodurch doppelstrSngige 
DNA (Produkt-DNA) erzeugt wird; 

(f) das Klonieren der Produkt-DNA in Expressionsvektoren zur Expression einer Sammlung von Proteinen, 
von denen jedes zumindest einen Teil einer variablen ImmunglobulindomSne umfasst. 

5. Verfahren nach Anspruch 4, worin nach Schritt (e) und vor Schritt (f) die folgenden Schritte durchgefuhrt werden: 

(i) Bewirken einer Trennung der zwei StrSnge der Produkt-DNA; 

(ii) Anellieren eines VonvSrts- und eines Ruckwarts-Oligonucleotidprimers an die getrennten Strange, wobei 
der Vorwartsprimer fur eine Sequenz am oder benachbart zum 3 -Ende des Sense-Strangs jeder Zielsequenz 
spezifisch ist und der Ruckwartsprimer fur eine Sequenz am oder benachbart zum 3 -Ende des Antisense- 
Strangs jeder Zielsequenz spezifisch ist, unter Bedingungen, die spezifische Hybridisierung der Primer an die 
Nucleinsaure erm6glichen; 

(iii) Behandlung der anellierten Probe mit einem DNA-Polymerase-Enzym in Gegenwart eines Desoxynucleo- 
sidtrlphosphats unter Bedingungen, die eine Primerextension bewirken, wodurch doppelstrangige Nuclein- 
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sSure erzeugt wird. 

6. Verfahren nach einem der Anspruche 1 bis 5, worin der Ruckwartsprimer fur eine Sequenz am oder benachbart 
zum 3'-Ende des Antisense-Strangs der Sequenzen, die in den Zielsequenzen enthalten sind und die zumindest 

5 einen Teil einer varlablen ImmunglobulindomSne kodieren, spezifisch ist. 

7. Verfahren nach einem der AnsprOche 1 bis 6, worin die Sammlung doppelstrangiger Nucleinsaureproben von 
Lymphozyten stammen. 

10 8. Verfahren nach Anspruch 7, worin die Lymphozyten von einem Menschen oder Tier stammen, der/das eine Im- 
munantwort auf ein Antigen zeigt. 

9. Verfahren nach Anspruch 7, worin die Lymphozyten von einem Patiehten mit einer Autoimmunerkrankung stam- 
men. 

16 

10. Verfahren nach Anspruch 1, worin die Nuclei nsaureprobensammlung von Genen fOr umgeordnete variable Do- 
mSnen von Immunglobulinen stammt. 

11. Verfahren nach Anspruch 1, worin die Nucleinsaureprobensammlung genomische DNA ist. 

20 

12. Verfahren nach Anspruch 1, worin die Nucleinsaureprobensammlung von Genen fQr nicht umgeordnete variable 
DomSnen von Immunglobulinen stammt. 

13. Verfahren nach einem der Anspruche 1 bis 12, worin die Zielsequenz eine Sequenz enthalt, die fur eine variable 
25 DomSne einer schweren Immunglobulinkette kodiert. 

14. Verfahren nach Anspruch 13, worin die Produkt-DNA in einen Expressionsvektor zur Expression von einzelnen 
Dom3nen-Liganden eingefOhrt vk^rd, die nach ihrer Bindungsaffinitdt fOr das Antigen selektierbar sind. 

30 1 5. Verfahren nach einem der Anspruche 1 bis 1 3, worin Produkt-DNA in einen Expressionsvektor zur Expression von 
Antik&rpern oder Fragmenten von Antikdrpem eingefOhrt wird, die nach ihrer Bindungsaffinitdt fUr das Antigen 

selektierbar sind. 

16. Verfahren nach einem der Anspruche 1 bis 13, worin die Produkt-DNA in einen Expressionsvektor zur alleinigen 
35 Expression eingefOhrt wird. 

17. Verfahren nach einem der Anspruche 1 bis 13, worin die Produkt-DNA in einen Expressionsvektor zur Expression 
in Kombination mit einer komplementSren variablen DomSne eingefOhrt wird. 

40 18. Verfahren nach einem der Anspruche 1 bis 13, worin die Produkt-DNA zur Expression in einen Expressionsvektor 
eingefOhrt wird, der bereits Sequenzen enthdit, die fOr eine oder mehrere konstante Domdnen kodieren. 

19. Verfahren nach einem der Anspruche 1 bis 1 3, worin die Produkt-DNA in einen Expressionsvektor zur Expression 
ats Fusionsproteine eingefOhrt wird. 

45 

20. Verfahren nach einem der Anspruche 1 bis 1 3, worin die Produkt-DNA in einen Expressionsvektor zur Expression 
mit Peptidmarkierungen eingefOhrt wird. 

21 . Verfahren nach einem der Anspruche 1 bis 13, worin Produkt-DNA, die zumindest fur variable Domanen schwerer 
50 Immungtobulinketten kodierende Sequenzen enth3lt, und Produkt-DNA, die zumindest fOr variable DomSnen leich- 

ter Immungtobulinketten kodierende Sequenzen enthSIt, in Expressionsvektoren zur Expression einer kombina- 
torischen Sammlung komplementSrer variabier DomSnen eingefOhrt werden. 

22. Verfahren nach Anspruch 21 . worin die Produkt-DNA zur Expression in einen Expressionsvektor eingefOhrt wird, 
55 der bereits Sequenzen enthait. die fOr eine oder mehrere konstante Domanen kodieren. 

23. Verfahren nach Anspruch 21 , worin Produkt-DNA in Expressionsvektoren zur Expression als Fusionsproteine ein- 
gefOhrt wird. 
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24. Verfahren nach Anspruch 21, worin die Produkt-DNA in einen Expressionsvektor zur Expression nnit Peptidmar- 
kieaing eingefOhrt wird. 

25. Verfahren nach einem der Anspruche 1 bis 24, worin die VorwSrts- und Ruckwartsprimer als einzelne Oligonu- 
cleotide bereitgestellt werden. 

26. Verfahren nach einem der Anspruche 1 bis 24, worin die VonA/Srtsprimer als Oligonucleotidgemische bereitgestellt 
werden. 

27. Verfahren nach einenn der Ansprtlche 1 bis 24, worin die RQckwSrtsprimer als Oligonucleotidgemische bereitge- 
stellt werden. 

28. Verfahren nach einem der AnsprOche 1 bis 27, worin jeder Primer eine Sequenz enthait, die fur eine Restriktions- 
enzym-Erkennungsstelle kodiert. 

29. Verfahren nach Anspruch 28, worin sich die Restrlktlonsenzym-Erkennungsstelle In der an die Nudeinsdure an- 
ellierten Sequenz befindet. 

30. Verfahren nach Anspruch 1 , worin der Ruckwartsprimer ein allgemeiner Primer ist, der zum Klonieren einer ge- 
wunschten Antikorperspezifitat einer spezifischen Spezies dient. 

31 . Verfahren nach Anspruch 1 , worin der Ruckwartsprimer ein Gemisch von Primern mit einer Vielfalt von Sequenzen 
ist, die so konzipiert sind, dass sie komplementar zu den verschiedenen Familien von VH-, Vk- oder V-Sequenzen 
sind. 

32. Expressionsbibliothek, die ein Repertoire von NucleinsSuresequnzen zur Expression eines Repertoires von Pro- 
teinen umfasst, die jeweils eine variable Immunglobulindomane umfassen. 

33. Expressionsbibliothek, die ein Repertoire an dritten CDR-Sequenzen umfasst, wobei sich die Sequenzen in einem 
ansonsten invarianten VH-Gen befinden. 


Revendications 

1 . M^thode de clonage de sequences (sequences clbles), chacune contenant une sequence codant au moins une 
partie d'un domaine variable d'immunoglobuline, taquelle m^thode consiste a produire un repertoire d'^chantillons 
de sequences cibles contenant de I'acide nucl^ique et d utiliser des amorces vers I'avant et vers Tarri^re dans le 
processus de copiage et de clonage des sequences cibles pour I'expression d'un repertoire des prot6ines, chacune 
comprenant au moins une partie d'un domaine variable d'immunoglobuline, I'amorce vers I'avant 6tant sp6cifique 
d'une sequence a ou prds de Textrdmite 3' du brin de sens de chacune des sequences cibles, I'amorce vers rarri^re 
etant spedfique d'une sequence d ou adjacente d Textr^mite 3' du brin anti-sens de chacune des sequences cibles. 

2. Methode seion la revendication 1, laquelle consiste d: 

(a) pr6voir un repertoire d'echantillons de sequences cibles contenant de I'acide nucieique d deux brins; 

(b) forcer les d eux brins de I'acide nucieique e deux brins a se s^parer; 

(c) recuire, sur rechantillon, une amorce d'oligonucl6otides vers I'avant et vers rarri6re, I'amorce vers I'avant 
etant specifique d'une sequence e ou d proximite de i'extremite 3' du brin de sens de chacune des sequences 
cibles, I'amorce vers I'arriere etant specifique d'une sequence e ou adjacente ci I'extremite 3' du brin de I'anti- 
sens de chacune des sequences cibles, dans des conditions qui permettent aux amorces de s'hybrider spe- 
cifiquement d I'acide nucieique; 

(d) trailer rechantillon recuit avec une enzyme d'ADN polymerase en presence de desoxynucieoside triphos- 
phates dans des conditions qui provoquent Textension de I'amorce pour ainsi produire I'acide nucieique d deux 
brins; 

(e) repeter les etapes (b) et (d) pour ainsi produire de I'ADN d deux brins (ADN produit) ne contenant que les 

sequences cibles; 

(f) doner I'ADN produit dans des vecteurs d'expression pour I'expression d'un repertoire de proteines dont 
chacune comprend au rnoins une par tie d'un domaine variable d'immunoglobuline. 
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3. M^thode selon la revendication 2, oCi les stapes (b) et (d) sont r6pdt6es un certain nombre de fois avant I'^tape (f). 

4. M^thode selon la revendication 1 , qui consiste d: 

5 (a) produire un repertoire d'ARNm; 

(b) recuire, sur TARNm, une amorce d'oligonucl6otides sp6cifique d'une s6quence d ou adjacente d rextr6mit6 
3' de chacune des sequences cibles sur les brins de sens, dans des conditions qui pennettent d Tamorce de 
s'hybrider sp6cifiquement d I'acide nucl^ique; 

(c) traiter I'ARNm recuit d I'amorce par une enzyme polymerase en presence de d^soxynucieoside triphos- 
10 phates dans des conditions qui provoquent I'extension de I'amorce pour ainsi produire I'ADNc anti-sens; 

(d) recuire, d I'ADNc, une amorce d'oligonucieotides sp^cifique d'une sequence d ou adjacente d Textr^mite 
3' de chacune des sequences cibles sur les brins anti -sens, dans des conditions qui permettent d Tamorce 
de s'hybrider sp6cifiquement d I'acide nucl6ique; 

(e) traiter I'ADNc recuit d I'amorce par une enzyme polymerase en presence de d6soxynud6oside triphos- 
15 phates dans des conditions qui provoquent I'extension de Tamorce, pour ainsi produire de I'ADN d deux brins 

(ADN prodult); 

(f) doner I'ADN produit dans des vecteurs d'expression pour ('expression d'un repertoire de proteines, chacune 
comprenant au moins une partie d'un domaine variable d'immunoglobuline. 

20 5. l\/iethode selon la revendication 4 oCi, apres retape (e), on accomplit, avant I'etape (0, les etapes suivantes: 

(i) provoquer la separation des deux brins de I'ADN produit; 

(ii) recuire, aux brins s6pares, une amorce d'oligonucl6otides vers I'avant et vers I'arri^re, I'amorce vers I'avant 
etant specifique d'une sequence e ou adjacente d I'extremite 3' du brin de sens de chacune des sequences 

25 cibles, I'amorce vers I'arhere etant specifique d'une sequence e ou adjacente e I'extremite 3' du brin d'anti- 

sens de chacune des sequences cibles, dans des conditions qui permettent aux amorces de s'hybrider spe- 
cifiquement d I'acide nucieique; 

(iii) traiter I'echantillon recuit avec une enzyme d'ADN polymerase en presence de desoxynucieoside triphos- 
phates dans des conditions qui provoquent une extension de I'amorce pour ainsi produire I'acide nucieique d 

30 deux brins. 

6. Methode selon I'une queiconque des revendications 1 d 5, ou I'amorce vers I'arhere est specifique d'une sequence 
d ou adjacente d I'extremite 3' du brin d'anti-sens des sequences qui sont contenues dans les sequences cibles 
et dont chacune code au moins une partie d'un domaine variable d'immunoglobuline. 

35 

7. Methode selon I'une queiconque des revendications de 1 d 6, ou le repertoire d'echantillons de I'acide nucieique 
^ deux brins est derive de lymphocytes. 

8. Methode selon la revendication 7, ou les lymphocytes sont derives d'un animal ou humain montrant une reponse 
40 immune e un antigene. 

9. Methode selon la revendication 7, ou les lymphocytes sont derives d'un patient presentant une maladie auto- 
Immune. 

45 10. Methode selon la revendication 1 , oCj le repertoire d'echantillons de I'acide nucieique est derive de genes de regions 
variables d'immunoglobuline qui sont rearrangees. 

11. Methode selon la revendication 1, oCi le repertoire d'echantillons de I'acide nucieique est I'ADN genomique. 

50 1 2. Methode selon la revendication 1 , ou le repertoire d'echantillons de I'acide nucieique est derive de genes de regions 
variables d'immunoglobuline qui ne sont pas rearrangees. 

13. Methode selon I'une queiconque des revendications 1 e 12, ou la sequence cible contient une sequence codant 
un domaine variable d'une chaine lourde d'immunoglobuline. 

55 

14. Methode selon la revendication 13, ou I'ADN produit est insere dans un vecteur d'expression pour I'expression de 
ligands de domaine simple pouvant etre seiectionnes par leur affinite de liaison pour I'antigene. 
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15. M6thode selon Tune quelconque des revendicatlons 1 d 13, oCi I'ADN produit est ins6r6 dans un vecteur d'expres- 
sion pour I'expression des anticorps ou des fragments d'anttcorps pouvant 6tre s6lectionn6s par ieur affinity de 
liaison pour I'antig^ne. 

5 16. MSthode seion I'une quelconque des revendicatlons 1 d 1 3, oCi I'ADN produit est ins6r6 dans un vecteur d'expres- 
sion pour I'expression seule. 

17. M6thode selon I'une quelconque des revendicatlons 1 6 13, ou I'ADN produit est ins6r6 dans un vecteur d'expres- 
ston pour I'expression en combinaison avec un domaine variable compl6mentaire. 

to 

18. M6thode selon I'une quelconque des revendicatlons 1 d 13, ou I'ADN produit est ins6r6 dans un vecteur d'expres- 
slon contenant d6j^ des sequences codant un ou plusieurs domaines constants pour I'expression. 

19. M^thode selon I'une quelconque des revendicatlons 1 d 13, ou I'ADN produit est ins^r^ dans un vecteur d'expres- 
15 sion pour I'expression en tant que prot^ines de fusion. 

20. M^thode selon I'une quelconque des revendicatlons 1 d 13, ou I'ADN produit est Ins^r^ dans un vecteur d'expres- 
sion pour I'expression avec des marqueurs de peptides. 

20 21 . M6thode selon I'une quelconque des revendicatlons 1 d 1 3, ou I'ADN produit contenant des sequences codant au 
moins des domaines variables de chatne lourde d'immunoglobuline et I'ADN produit contenant des sequences 
codant au molns des domaines variables de chaTne l^g^re d'immunoglobuline sent ins^r^s dans des vecteurs 
d'expression pour I'expression d'un repertoire en combinaison de domaines variables compldmentaires. 

25 22. Mdthode selon la revendication 21, ou I'ADN produit est ins^rd dans un vecteur d'expression contenant d6jd des 
sequences codant un ou plusieurs domaines constants pour {'expression. 

23. M^thode selon la revendication 21 , oCi I'ADN produit est ins^rd dans des vecteurs d'expression pour I'expression 
en tant que prot6ines de fusion. 

30 

24. M^thode selon la revendication 21, oCi I'ADN produit est ins6r6 dans un vecteur d'expression pour {'expression 
avec des marqueurs de peptides. 

25. M6thode selon I'une quelconque des revendicatlons 1 ^ 24, ou les amorces vers I'avant et vers Tan-i^re sont 
35 produites sous la forme d'oligonucl6otides simples. 

28. M^thode selon I'une quelconque des revendicatlons 1 d 24, ou les amorces vers I'avant sont fournies sous la 
fonne d'un melange d'oiigonucl^otides. 

40 27. M6thode selon I'une quelconque des revendicatlons 1 d 24, ou les amorces vers {'arridre sont fournies sous la 
forme d'un melange d'ollgonucl^otides. 

28. l\/16thode selon I'une quelconque des revendicatlons 1 d 27, ou cliaque amorce contient une sequence codant un 
site de reconnaissance d'une enzyme de restriction. 

45 

29. M^thode selon la revendication 28, ou le site de reconnaissance d'une enzyme de restriction est plac6 dans la 
sequence qui est reculte d I'aclde nucl^ique. 

30. {^ethode selon la revendication 1, ou I'amorce vers Tarri^re est une amorce g6n6rale utile pour doner une sp6ci- 
50 ficite d'anticorps souhait^ d'une esp^ce sp^cifique. 

31 . t^ethode selon la revendication 1 , ou I'amorce vers I'arri^re est un melange d'amorces ayant une variety de se- 
quences designees pour dtre compl^mentaires aux diverses families des sequences VH, Vk ou V. 

55 32. Bibliotlieque d'expression comprenant un repertoire d'acides nucieiques pour {'expression d'un repertoire de pro- 
teines, chacune comprenant un domaine variable d'immunoglobuline. 

33. Bibliotheque d'expression comprenant un repertoire de troisiemes sequences de CDR, lesdites sequences etant 
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plac^es dans un g^ne VH autrement invariant. 
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Rearranged heavy chain vartabte gene (DNA) 



i * 

DH 

JH 


] — 

1 


oromoter 
40cl«mc>cotidemotin 


VHIBACK 

HUVHBACK 

MOVHBACK 


VHIFOR 
VHIFOR-2 
HUJHFOR 
MOJHFOR 


Unrearranged heavy chain variable gene (DMA) 

I — o 


■co-c 


VH 


]• 


promoUr 
(Oct«nuc1eottde motiO 


HUHEPIFOR 


HUOCTABACKI 
Rearranged heavy cham variable gene CmRNA) 

^ III VH |dh|jh| CHI |h| CH2 I CH3 | 

VHIBACK HUVHFOR 

HUVHBACK 

MOVHBACK 


Rearranged light chain variable gene (DNA) 

I O (ZI E 


promftttr 


VK 


VKIBACK 
VK2BACK 
nOVKBACK 


VKIFOR 

VK2F0R 

VK3F0R 

MOJKFOR 

HUJKFOR 


FIG. 1 


33 


EP 0 368 684 B2 


Psd 

VHIBACK ^ 

^ VHIFOR 

BstEn 


Hindm PstI BstEn BamHI 
rniTTPimIT i — m ^^^^ i PniTT 

P L VDJ 


y IgmRNA 3' 



M13-VHPCR1 


Hindm PstI BstEn BamHI 
PvuIIPvuIIi ^ M J L ^ Pv"P 


rrT 


P L VDJ 


M13-VHIg 


FIG. 2 


Pvun 


VKIBACK 


VKIFOR 
Bgin 


S' IgmRNA 3' 



Hindin PvuIIBcII BamHI 
PL VJ 


M13-VKPCR1 


Hindin Pvun BamHI 

J 


PL VJ 


M13-VKIg 


FIG. 4 
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M13 VBPCM, 

HinD III'" 

10 20 30 40 50 60 

70 80 90 100 110 120 


MGWSCIILFLVATAT 
CATGGCyVTOGAGCTGTATX:ATCCTCriTCTTGGTAGC^ 

130 140 150 160 170 IBO 


AGTAGCAGGCTTGAGGTCTGGACATATATATGGGTGAC^ 

190 200 210 220 230 240 

PstI 

1 5( 10 

GVHSQVQLQESGPGLVRP 
- TCTCCACAGGTGTCCACTCCCAGGTCCAACIGCB^iSA^^ 

250 260 270 280 290 206 

CDRl 

15 20 25 30 

SQTLSLTCTVSGSTFSSy»'M 

ctagccagac<xtgagcctgaccixx:accgtgtctggcagcacctt^^ 

310 320 330 340 350 360 

CDR2 

35 40 45 50 

i/wvrqppgrglewigi?idp/j 
tgcactgggtgagacagccacctggacgaggtcttgagtcgat^^ 

370 360 390 400 410 . 420 

55 60 65 70 

SGGTKYNEKFKSKVTHLVDT 
ATAGTGGTGGTACTAAGTACAATGAGIUlGTTCAAGftGCAGAGT^^ 

430 440 450 460 470 480 

75 80 85 90 

SKNQFSLRLSSVTAADTAVY 
CCAGCAA6AACX:AGTICAGCCTGAGACTCAGCAGe^ 

490 500 510 520 530 540 

CDR3 

95 100 105 110 

YCARyDryGSSrFDrWGQGT 
ATTATTGTGCAAGATACGATTACTACGGTAGTAGCTACTTIX^CI^ 

550 560 570 580 590 60b 

BstEII 
115 I 120 
T V T V S $ 
CCAQQGICACCSTCTCCTCAGGTGAGTrc 

610 620 630 640 650 660 

AGATTTTACTGCATTTCITGGGGGGGAAATGTGTGTATCT^ 

670 680 690 700 710 720 

CTAGGGACACCTTGGGAGTCAGAAAGGGTCATTGGGAGCCCGGGCTGAl^^ 

730 740 750 760 770 780 

BanHI 
I 

rcCTOU^CTCCTAGACTTCATGGCCAGAGATmT^ ^ I O 

790 800 810 r I W. J 
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M13 VfcPCRl 

HinD III 

AA(y.TTA TGAATATGCAAATCCTCTGARTCTACATGGT^ 

38 48 5B 68 78 88 


98 108 118 128 138 148 


MGWSCIILFLVATAT 
CATGGGATGGftGCTGTATCATCCTCrriCTTGGTASCAAC^^ 

158 16B 178 188 198 208 


AGTAGCAGGCTTGAGGTCTGGACATATATAOX^GGTGACAATGACATC^ 

218 228 238 248 258 268 


Pvu II 
I 

15 10 
GVHSDZQhTQSPSSLSAS 
TCTCX»CAG6TGTCCACICCGACATC£&S£ZGA0CCAG^ 

278 288 298 308 318 328 


CDRl 

15 20 25 30 

WGDKVTITCRASGNIHNYLA 
GOKXMGTGACAGAGTGACCATCACCTCTAGAGtXAGC^^ 

338 348 356 368 378 388 

CDR2 

35 40 45 50 

tfrQQKPGKAPKLtiYrrrrL 

CTT<3GTA0CAGCAGAAGCXAGG1AAGGCTCCAAAGC^^ 

398 408 418 428 438 448 


55 60 65 70 

ADGVPSR FSGSGSGTDFTFT 
TGGCTGACGGTGTGCCAAGCAGATTCAGCGGTAGCGGTAGCGGTACr 

458 468 478 488 498 508 

CDR3 

75 80 85 90 

ISSLQPEDIATYYC<?Hr(fSr 
CCAICAGCAGCXnXXAGCCAGAGGACATaXCACXnACT^^ 

518 528 538 548 558 568 


Bel I (requires dam~ host) 
I 

95 100 105 108 

PRrrCQGTKVVIKR 
CCCCAAGGACGTTCGGCCAAGGGACCAAGGTGGIGAXC&AACGl^^ 

578 588 598 608 618 628 


BamHI 
I 

638 648 r\\J. O 
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Sequence of MBrl VH 

Splice --1 
46 V H S 
AGGTGTCXaCTCC 

1 PstI 10 20. 

Q V Q L Q E SGTELASPGASVTL 
CAGGTCCAACTGCAGGAGTCAGG ARCTGRGCTGGa^^ 
VHIBACK SITE 

30 CDRl 40 

SCKAS'GYTP T ip H I I Nl W V K K R 
TCCTGCAAGGCTTCTGGCTACACATTTACrcyVCCATATTATAAATTGGGTJ^^ 

52a 53 CDR2 

PGQGLEWIG IR I Y P V S G V T N Yl 
CCTGGACAGGGCCTTGAGTGGATTGGAAGGATTTATCXA^ 
60 CDRi 65 70 

I N O K F M Gl KATFSVDRSSNTVy. 
AATCAAAAATTCAIGGGCAAGGCCACATTCTCTGTAGACCGGTCC^ 
80 82A B C 83 90 ,CDR3 

MVLNSLTSEDPAVYYCGR |G Fl 
ATGGTGTTG2ACAGTCrGACATCrGAGGACCCTGCTGTCTATO 

CPR3 1 03 BstEIl Splice 

1 D F D Yl W G Q G T T V T V S S i 

GATTTTGACTaC TGGGGCCAAGGGACCftCGGTCaCC^^ 

VHIFOR SITE 


Sequence of MBrl VK 

Splice -1 
4 G V H S 
AGGTGTCCACTCC 

1 PvuII 10 20 

D I Q L TQSPPSLXVSVGERVT 
GACATTCAGCTGACCCAGTCTCCAC CATCCCTGACTGTGTCAGTaflGAGA 
VKIBACK SITE 

, 27A B C D E - F CDRl 

I S ClKSNONLLWSGNRRYC L GI 
ATCAGTTGCAAATCCAATCAGAATCTTTTATGGAGTGGAAAC^ 
35 40 50 CDR2 

WHQWKP GQTPTPLIT IW T S D Rt 
TGGCACCAGTGGAAACCAGGGCAAACTCCTACACCGTTGATCACCTGGACATCTGATA^ 

, . 60 70 

I F Si GVPDRFIGSGSVTDFTLT 
TTCTCTGGAGTCCCTGATCGTTTCATAGGCAGTGGATCTGTGAC^GATTTCACTCT 

80 90 CDR3 

.1 SSVQAEDVAVYFCQ i Q H L D L| 
ATCAGCAGTGTGCAGGCTGAAGATGTGGCAGTTrATTTCTGTCAGCAACATTTGG?^ 

95 100 Bglll/BcH Splice 

I P Y Tl F G G G T K L E I K ^ 
CX:GTAavCGTTCGGAGG GGGGACCAAGCTGGAGATCRAACG TGA^ 
VKIFOR SITE 

FIG. 6 
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FIG. 7 
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a-Lys 17 


HindUI 



BamHI 


EcoRI- 


aLys-17 
12.27 kb 


Kappa enh 


1.92 



1.7 


SacI 


(BamHI) 


3 EcoRI sites internal 
to kappa fragment 


2.9 


FIG. 8 


pSV-gpt Hindni Psd BstEn BamHI BamHI 

• i >Bi z ZH — [ZH-D-IZZHZZH- 

E P L VDJ Cv 


FIG. 9 


pSV-hyg 


Hindm Pvun BamHI 

J — c 



Ck 
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FRl 


CPR 1 


KABAT lA 

A07 PGLVKPSQSLSLTCSVTGYSIT SGYYWN WIRQTPGNKLEWMG YISYD6SNNYNPSLKN 

A09 PGLVKPSQSLTLTCSITGFPIT SGYYHI WIRQSPCKPLEHMG YITHSGETTYNPSLQS 

E03 PGLVKPSQSLSLTCSVTGYSIT SGYYWN WIRQFPGNKIEWMS YISYDGSNNYNPSLKN 

GOX PGLVKPSQSLSLTCSVTGYSIT SGYYWN MIRQFPGNKLEWMG YISYDGSNNYNPSLKN 

KABAT IB 

A06 PVLVAPSQSLSITCAVSDFSLT NYGVL WVRQPPGKGLEWLG VIWAGGITNYNSALMS 

25G07 PGLVQPSQSLSITCrVSGFSLT SYGVH WVRQSPGKGIZWLG VIWSGGSTDYNAAFIS 

B03 PGLVAPSQSLSITCrVSGFSLT SYGVD VfVROTPGKGLEWLG VIKGGGSTNYNSALMS 

G03 PGLVQPSQSLSITCrVSGFSLT SYGVH WVRQSPGKGLEWLG VIWSGGSTDYNAAFIS 

H09 PVLVAPPQSLSITCTVSGFSLT SYGVH WVRQPPGKGLEWLG VIWACGSTWNSALMS 

25C10 PGLVAPSQSLSITCTVSGFSLT SYAIS WVRQPPGKGLEWLG VIWTGGGTNYNSALKS 

A12 PGLVAPSQSLSITCTVSGFSLT SYAIS WVRQPPGKGLEWLG VIWTQGGTHYNSALKS 

AOS PGLVAPSQSLSITCTVSGFSLT SYGVH WVRQPPGKGLEW** **»**CSTm«ALICS 

25G08 PGLVAPSQSLSITCTVSGFSLT SYDVD WVRQSPGKGLEWLG VIWGGGSTNYNSALKS 

A03 PGLVQPSQSLSITCrVSGFSLT SYGVH WVRQSPGKGLEWLG VIWSGGSTDYNAAFIS 

C07 PVLVAPSQSLSITCTVSGFSLT SYGVH WVRQPPGKGLEWLG VIWAGGSTNYNSALMS 

H04 PGLVAPSQSLSITCTVSGFSLT SYGVD WVRQSPGKGLEWLG VIWGVGSTNYNSALKS 


KABAT lU 


£04 

PELVRPGV5VXISCKGSGYTFT 

DYAMH 

WVKQSHAKSLEWIG 

H07 

PELVRPGVSVKISCKGSGYTFT 

DYAMH 

WVKQSHAKSLEWIG 

KABAT IIB 



A02 

AELVMPGASVKLSCKASGYTFT 

SYWMH 

WVKQRPGQGLEWIG 

B04 

AELVKPGASVKMSCKASGYTFT 

SYWIT 

WVKQRPGQGLEWIG 

COS 

AELVKPGASVKLSCKASGYTFT 

STWnn 


C09 

AELVKP6ASLKLSCKASGYTFT 

SYWMH 

WVKQRPGQGLEWIG 

006 

ASLVKPGASVKMSCKASGYTFT 

SYWIT 

WVKQRPGQGLEWIG 

DOB 

PELVKPGASVKLSCKASGYTFT 

SYWMH 

WVKQRPGQGLEWIG 

E07 

AELVRPGASVKLSCKASGYTFT 

DYEMH 

WVKQTPVHGLEWIG 

G08 

PELVKPGASVKI SCKASGYTFT 

DYYIN 

WVKC^GQGLEWIG 

GIO 

AELVKP6ASVKVSCKASGYTFT 

SYWMH 

WVKQRPGQGLEWIG 

25G09 

AELVKPGASVKMSCKASGYTFT 

TYPIE 

WVKQJIHGKSLEWIG 

F04 

TCLVKPGASVKLSCKASGYTFT 

SYWMH 

WVKC^IPGQGLEWIG 

H02 

AELVKPGASVKLSCKASGYTFT 

SYWMB 

WVKQRPGQGLEWIG 

HOX 

AELVMPGASVKLSCKASGYTFT 

SYW1H 

WVKQRPGQGLEWIG 

25C05 

PELVRPGTSVKMSCKASGYTFF 

NYWMK 

WV*ORPCC?CLEWIG 

BOX 

AELVKPGASVKMSCKASGYTFT 

SYWIT 

WVKQRPGQGLEWIG 

BOS 

AELVRPGSSVKLSCKDSYFAFM 

RKAMH 

NVKQRPGHGLEWIG 

BIX 

AELVKPGASVKMSCKASGYTFT 

SYWIT 

WVKQRPGQGLEWIG 

KABAT in A 



25G0S 

GGLVQAWGSLSLSCAASGFTFT 

OYYMS 

WVRQPPGKALEWLG 

CIO 

G6LVQPGGSLSLSCAASGFTFT 

DYYMN 

WVRQPPGKAI£WLA 

B07 

GGLVQPGG5LSLSCAASGFTFT 

DYYM5 

WVRQPPGKAIEHU 

KABAT tit B 



G05 

GGLVKPGGSLKLSCAASGFTFS 

DYGMH 

WVRQAPEKGLEWVA 

812 

GGLVQPGESLKLSCESNEYEFP 

SKDMS 

WVR*»*****'*VA 

D04 

GGLVQPGGSLRLSCAASGFTFS 

SYAMS 

W/k*APGKGlEMVS 

DOS 

GGLVQPGGSLRLSCAASGFTFS 

SYAMS 

VNk»APGKGLEUVS 

F12 

GGLVQPGESWKLSCVIQQ ' * » * 

*tf 

WRQ^PEKRLELVA 

F06 

GGLVQPGGSLRLSCAASGFTFS 

SYAMS 

WPi^APGKGLOiVS 

D02 

GGLVQPGESLKLSCESNEYVIP 


WVR^SGE*I£LVA 

F09 

GOLVKPGGSLKLSCAASGFTFS 

SYGMS 

WVRQTPOKRLEWVA 

KABAT in C 



E06 

GGLVQPGGSMKLSCAASGFTFS 

OAWMD 

WVRQSPEKGLEWVA 

KABAT V A 



C04 

AELVKPGASVKLSCKASGYTFT 

EYTIH 

WVKQRSGQGLEHXG 


VISTYYCa)ASYNQKFKD 
VISTYYOTASYNQKFKD 


EIDPSDSYTNYNCKFKG 
DIYPGSGSTNYNEKFKS 
RIOPNSGGTKYNEKFKS 
EINPSNGGTNYDEKFKS 
DIYPGSGSTNYNEKFKS 
EINPSNGGTNYNEKFKS 
AIDPETGGTAYNQKTKG 
WIYPGSGNTKYNEKFKG 
RIHPSDSDTNYNCICFKG 
NFHPYNDDTKYNEKFKG 
HINPSNG6TNYNCSCFK6 
NIDPSDSETHYNQKFKD 
EIDPSDSYTNYN*KVOG 
QIFPASGSIYYNEHHKD 
DIYPGSGSTNYNEKFKS 
SFTMYSDATEYSENFKG 
DIYPGSGSTNYNEKFKS 


FIRNKANGYTTEYSASVKG 
LIRHKANGYTMEYSASVKG 
LIRNKANGYTTEYSASVKG 


YISSGSSTIYYADTVKG 
Alt^DGGSTYYPDT}^ 
AISGSGGSTYYADSVKG 
AISGSOGSTYYADSVKG 
AINSDGGSTYYPDTHER 
AISGSGGSTYYADSAKG 
AINSDCGSTTYPDimR 
TISSG6SYTYYPDSVRG 


EIRNKANNHATYYAESVKG 


WFYPGSGSIKYNCKFKD 


FIG. 10a 
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FR 3 row % 


RISITRDTSKNQFFLKLNSVTTEDTATYYCAR EGNWDGFAY 

PISITRETSKNQFFLQLNSVTTEDTAMYYCAG DRDKLGPWFAY 

RISITRDTSKNOFFLQLNSVTTEDTATYYCAR DSSGSM3Y 

RISITRDTSKNQFFLKLNSVTTEDTATYYCAR VSSGYESMDY 


RLSISKOTSKSQVFLKMNSLQTDDTAVYYCAK 
RLSISKDNSKSQVFFKMNSLOADDTAIYYCAR 
RLSISKDNSKSCfVFLKTOlSLQTDDTAMYYCAK 
RLSISKDNSKSQVFFKMNSLQADDTAIYYCAR 
RLSISKDNSKSaVFU<MNSLOTDDTAMYYCAI 
RLSISKDNSKSQVFLKMHSLQTDD7ARYYCAR 
RLSISKDNSKSOVFLKMNSUrrDOTARYYCAR 
RLSlSHDNSKSQVFUQSiSLOfTDDTAMYYCM 
RLSISKDNSKSCVFLKMNSLQTDDTAMYYCAR 
RLSISKDNSKSQVFFKMNSLOWJDTAIYYCAR 
RLSISXDKSXSQVFLKMNSLQrDOTAMYYCAK 
RLSISKDNSKSOVFLRMNSLOTDDTAMYYCAS 


HGDSSGYFOY 

NDGYY 

LSRGYAMDY 

KRDYDYDRGYYYAMDY 

YYDGSFFAY 

EGYYYFAY 

IYYDGS5DYYAHDY 

13 nt. 

21 nt. 

28 nt. 

37 nt, 

32 nt. 


Ps .gene/Unproductlt 

Unproductive 

Unproductive 

Unproductive 

Unproductive 


KATMTVDKSSSTAYMELARLTSEDSAVyyCAR 
KATKTVOKSSSTAYMELARLTSEDSAVYYCAR 


40 nt. 
22 nt. 


Ur^roductive 
Unproductive 


KATLTVDKSSSTAYMQLSSLTSEOSAVYYCVR RGLTYA^OY 

KATLTVOTSSSTAYMQLSSLTSEOSAVYYCAR YYSNYFDY 

KATLTVDKPSSTAYMQLSSLTSEDSAVYYCAR FNWDHYYYGKDV 

KATLTVOKSSSTAYMQLSSLTSEDSAVYYCTL LYYYAMDY 

KATLTVDTSSSTAYMQLSSLTSEDSAVYYCAR SSGYDY 

KATLTVDKSSSTAYMOLSSLTSEDSAVYYCTI GAARATNftY 

KATLTVDKSSSTAYMQLSSLTSEDSAVYYCAR GGFAY 

KATLTVDTSSSTAYMOLSSLTSEDSAVYYCAR SPM)Y 

KATLTVDKSSSTAYMQLSSLTSSOSAVYYCAI EVPGGFYATDY 

KATLTVEKS5STVYLELSRLTS0DSAVYYCAR MDYYGSSLWFAY 

KATLTVDKSSSTAYMQLSSLTSEDSAVYYCAK TTWAFOY 

KATLTVDKSSSTAYMQLSSLTSEDSAVYYCAR KRDYSTYFDH 

KATLrVDKSSSTAYMSLSSLTSEDSAVYYCAP TGTEFAY 

KAAHAVDTSSSTAYMQLSSLTSEDTAVrFCL* 24 nt. 

KATLTVDKPSDTAYMOLSSLTSEDSASYYCAR 9 nt. 

KATLTANTSSSTAYMELSSLTSEDSAVYYCAR 23 nt. 

KATLTVDTSSSTSYMQLSSLTSEDSAVYYCAR 15 nt. 


Ps.gene 

Ps . gene / Unproductit 
Unproductive 
Unproductive 
Unproductive 


RFTISRONSQSILYUIMNALRAEDSATYYCAR YMILGAM)Y 
RFTISRDNSQSILYU3MNALRAEDSATYYCAR GYYYDGSYYAMDY 
RrriSRDNSOSILYLQMNAtRAEDSATYYCAR 23 nt. 


Unproductive 


RFTISBDKAKNTLFLQHTSLRSEDTAMYYCAR AKFHLYFDY 

RFIJSRDNTKKTLYUIMSSIRSEDTALYYCAR RESWESRWGDV 

HFTISRONSKNTLYLGMNSIMAEDTAVYYCAD RGLHUFDP 

RFTISRDNSKtmYLQMNSmAEDTAVYYCAK fWYGSSPFDY 

RFIlSRDNSKKTLYLQMSSLRSEU)TALYYCAli PPffffSY 

nrTISRDNSKNTLYLQMNSLRAEDTAVYYCAK 43 nt. 

RFJISRDNTKKTLYLQtiSSLRSEDTALYYCAR 28 nt. 

RFTISRDNAKNTLYLQMSSLKSEDTAMYYCAR 35 nt. 


Ps.gene 
Ps.gene 
Ps.gene 
Ps.gene 

Ps . ge ne / Unp roduct i^ 
Ps . gene/Unproduct i^ 
Unproductive 


RFTISRODSKSRVYLQMNSLRAEDTGIYYCTG 


30 nt. 


Unproductive 


KATLTAOKSSSTVYMELSRLTSEOSAVYFCAR HEORDSSGYAMDY 

FIG. 10 b 
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CPR 2 


rWtflEWORK 3 


COR 3 


KABAT BOMAH VBX 


STSTAYMELRSLRSEOTAWYCAR GEGWDHFDY 
HAQKFQG RVTIRRHKSTSTAYMELSSLRSEDTAVYYCAR GSRYGYDCSGYYyL 
GYAQKFQG HVTMTRNTSISTATMELSSLRSEDTAVYYCAR LAHrSGSPVDWFDP 


KABAT HOMAN V£2 


KHQIOPSLKS RVTISVDTSKNQFSLKLSSVTAADTAVYYCAR GGWPAAIMDV 

KS RVTISVDTSKNQFSUCLSSVTAADTAVYYCAR MARYYDFWSGYSAYYDY 

SLKS RLSISQDTSRNQFSLRLSSVTAADTAVYYCAR HKMWGSPVHFOY 

ESTSTAYMELSSLBSEDTAVyYCAR DSYGDYG6HY 


2CABAT BUMAK VB3 


ISYITSSSSYTNYADSVKG 
SVKG 
YADSVKG 
YYADSVRD 
DSVKG 

VSAISGSGGSTYYADSVKG 
AVISYDGSNKYYADSVKG 
GAVISYDGSNKYYADSVKG 

QYAASVKG 

X2IOHK FAMILY 


RFTISRDNAKNSLYLQMNSLRADDTAVYYCAR 
RFTISRDDSKSIAYLQVNSLKTEDTAVYYCTR 
RFTISBDHAKHSLFLQMSSLRAEDTAFYYCAR 
RFTISRDMSKNTLYLQ^^SLRAEDTAVYYCAK 
RFTISRDNAKNSLYLQMNSLRDEDTAVYYCAR 
RFTISRDNPKNTLYLQMNSLRSEDTAVYYCAR 
RFTISRDNSKNTLYLQMNSLRAEDTAVYYCAR 
RFTISRDNSKNTLYLQMNSLRAEDTAVYYCAS 
AKNSLYLQMNSLRAEDTAVYYCVR 
RFTISRDDSKNSLYLQMNSLNTEDTAVYYCVR 


DGRFGTYSPSDY 
TIYYDSS6YPYW 
GXAU)ArOI 

53 NT. UNPBOD REARR 

DHSGTGGGGS6SYF 

KDNU^FDP 

DLGGRGWWPAPGGRSI Y YYGMDV 

LEGIGTIYYYGMDV 

DDS5SWPKHFQH 

SGWPYLDY 


AVYYCAR DPRIAARPDYYYYffDV 
TAMYYCAR GAEWEPTARYYYGLNV 


FIG. 11 
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YTFT SYGIS 

GEKPGSSVKVSCKASGYTFT DYFMN 

QVQLQEI6PRTGEASETLSLICAVSGDSIS SGNW*I 

QVQLQESGPGLVK* SETLSLTCTVSGGSIS SYYWS 

GYTFT NYCMH 

QVQLQESGPGLVKpSETLSLYCAVSGDS I S SGNW* I 

GPRLGEASETLSLTCTVSGGSIS SSSYYw 

QVQLQESGPGLVKpSETLSLTCTVSGGSIS SYYWS 

LSLICAVSGSSIS SGNW*I 

SETLSLTCAVYGGSFS GYYWS 

QVQLVQSGAEVKKPGASVKVSCKASGYTFT NYCMH 

SETLSLICAVSGDSIS SGNW*I 

SRAQTGEASETLSLTCTVSGGSI S SSSYYWG 

CPLTCTVSGGSVSSGS YYWS 

GLVKPSETLSLTCTVSGGSI S SYYWS 

SFETLSLICAVSGDSI S SGNW* I 

QVQLVQSGAEVKKPGSSVKVSCKASGGTFS SYAIS 

QVQ!LQQWGAGLLKPSETLSLTCAVYGGSFS GYYWS 

QLQLQESGPGLVKPSETLSLTCTVSGGSIS SSSYYWG 

CSGLVKPSQTLSLTCTVSGGSIS SGGYYWS 


WVTTCPWTRDLHWMG 
WMBQAPGQBLEWMG 
WVRQPPGKGLEWIG 
WIrqppGKGLEWIG 
WVRQDHAQGLEWMG 
WVRQPPGKGLEWIG 
WIRQPPGKGLEWIG 
WIRQPPGKGLEWIG 
WVRQPPGKGLEWIG 
WIRQPPGKGLEWIG 
WVRQVLAQGLEWMG i 
WVRQPPGKGLEWIG 
WIRlC^PGKGLEWIG 
WIRQPPGKGLEWIG 
WIGSPpGKGLEWIG 
WVRQPPGKGLEWIG 
WVRO?VPGQGLEWMG 
WIRQPPGKGLEWIG 
WIRQPPGKGLEWIG 
WIRQNPGKGLEWIG 


* indicates stop codon ( unsure as sequence remains in frame) 

• sequence termonates due to internal restriction site [ 
lower case denotes frame shift j 


[cDR2 


FR3 


CDR3 


WI S AYNGNTNYAQKLQG 
WINAGNGNTKYSQKLQG 
EIHHS6STYYNPSLKS 
RIYTSGSTNYli^SLKS 
LVCPSDGSTSYAQKFQA 
EIHHSGSTYYNPSLKS 
EINHSGSTNYNPSLKS 
YIYYSGSTNYNPSLKS 
EIHHSGSTYYNPSLKS 
EIHHSGSTNYNPSLKS 
LVCPSDGStSYAQKFQA 
EIHHSGSTYYNPSLKS 
SIYYSGSTYYNPSLKS 
YIYYSGSTNYNPSLKS 
RIYTSGSTNYNPSLKS 
EIHHSGSTYYNPSLKS 
RIIPILGIANYAQKFQG 
EINHSGSTNYNPSLKS 
EINHSGSTNYNPSLKS 
YIYYSGSTYYNPSLKS 


RVIWTTDTSTSTAYMELRSLRSDDTAVYYCAR DTVSS 
RVTITRDTSASTAYMQLSSLRSEDTAVYYCAR DTVSS 
RITMSVDTSKNQFYLKLSS» 

RVnSVDTSKNQFSLKLSSVTAADTAVYYCAR DTVSS 

RVTITRDTSMSTAYMELSSLHSEDTAMYYCAR DTVSS 

RITMSVDTSKNQFYLKLSS* 

RVTISVDTSKNQFSLKLSS* 

RVTISVDTSKNQFSLKLSS* 

RITMSVDTSKNQFYLKLSS* 

RVnSVDTSKNQFSLKLSSVTRADTAVYYCAR DTVSS 
RVTITRDTSMSTAYMBLSSLRSEDTAMYYCAR DTVSS 
RITMSVDTSKNQFYLKLSS* 
RVTIPVDTSKNQFSLKLSS • 

RVTISVDTSKNQFSLKLSSVTAADTAVYYCAR DTVSS 
RVTMSVDTSKNQFSLKLSS • 
RITMSVDTSKNQFYLKLSS • 

KVTITADKSTSTAYMELSSLRSEDTAVYYCAR DTVS 
RVTI SVDTSKNQFSLKLSS • 
RVTI SVDTSKNQFSLKLSS • 

RVTI SVDTSKNQFSLKLSSVTAADTAVYYCAR DTVSS 


FIG. 12 
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pSWl 

Hindlll site AAGCTT 

MKYLL.PTAA 
G«T<»UWVTTCTATTTCAAGGAGACAGTCATAATGAAA^^ 

10 20 30 40 50 60 


AGLLLLAAQPAMAQVQLQE S 
GCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCaGGTGCAGCTGCAGGAGTCA 
70 80 90 100 110 120 


GPGLVAPSQSLSITCTVSGF 
GGACCTGGCCTGGTGGCGCCCTCACAGAGCCTGTCCATCACATGCACCGTCTCA^ 

130 140 150 160 170 180 


SLTGYGVNWVRQPPGKGLEW 
TCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTGGAGTGG 
190 200 210 220 230 240 


LGMIWGDGNTDYNSALKSRL 
CTGGGAATGATTTGGGGTGATGGAAACACAGACTATAATTCAGCTCTCAA^ 

250 260 270 280 290 300 


SXSKDNSKSQVFLKMKSLHT 
AGCATCAGOUVGGACAACTCCAAGAGCCAAGTTTTCT 

310 320 330 340 350 360 


DDTARYYCARERDYRLDYWG 
GATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCTTGACTACTGG^ 
370 380 390 400 410 420 


QGTTVTVSS Snial 
CAAGGCACCACGGTCACCGTCTCCTCATAATAAGAGCTAT^mS^ 

430 440 450 460 470 480 


FIG. 13 
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PSW2 . . 
Hindlll AAGCTT 

MKYLLPTAA 
GCATGCAAArXCTATTTCAAGGAGACAGTCAXAATGAAATACCTATTGCCTACGGCA 

10 20 30 40 50 60 


AGLLLLAAQPAMAQVQLQE 'S 
GCTCXyvTTGTTATTACTCGCTGCCCAACaGCGATGGCCCAGGTGCAGCTGCAGGAG^ 
70 80 90 100 110 120 


GP G LVAPSQSLSITCTVSGF 
GGACCTGGCCTGGTGGCGCCCTCACAGAGCCTGTCCATCACATGCACCGTCTCAGGGTTC 
130 140 150 160 170 180 


SLTGYGVNWVRQPPGKGLE.W 
TCATTAACCGGCTATGGT6TAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTGGAGTGG 
190 200 210 220 230 240 


CTGGGAATGATTTGGGGTGATGGAAACACAGACTATAATTCAGCTCTCAAAXCCA^ 

250 260 270 280 290 300 


SI SKDNSKSQVFLKMNSLH T 
AGCATCAGCAAGGACAACTCCAAGAGCCAAGTTTTCTTAAAAATGAACAGTCT^ 

310 320 330 340 350 360 


D DTARYYCARERDYRLDYWG 
GATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCTTGACTACTGGGGC 
370 380 390 400 410 420 


QGTTVTVSS 
CAAGGCACCACGGTCACCGTCTCCTCATAATAAGAGCTCGAATTCGCCAAGCTTGCATGC 
430 440 450 460 470 480 


MKYLLPTAAA 5 
AAATTCTATTTCAAGGAGACAGTCATAATGMATACCTATTGCCTACGGCAG^^ 

490 500 510 520 530 540 


LLLLAAQPA.MADIVLTQSP V 
TTGTTATTACrCGCTGCCCAACCAGCGATGGCCGAO^TCGTCCTGACTCAGTCTCCAGCC 
550 560 570 580 590 600 


SLSASVGETVTITCRASGNI 
• TCCCTTTCTGCGTCTGTGGGAGAAACTGTCACCATCACATGTCGAGCAAGTGGGAA 

610 620 630 640 650 660 


HNYLAWYQQKQGKSPQLL V.Y 
' CACAATTATTTAGCATGGTATCAGCAGAAACAGGGAAAATCTCCTCAGCTCXTGGTCT^^ 
670 680 690 700 710 720 

FIG, Ua 
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YTTTLADGVPSRFSGSGSGT 
TATACAACAACCTTAGCAGATGGTCTGCCATOAGGTTCAGTGGCAGTGffi 

730 740 750 760 770 780 


QYSLKINSLQPEDFGS YYCQ 
CAATATTCTCTOAGATCAAa^CTGCAACCTGAAGATTTO 

790 800 810 820 830 840 


HF'W3XPRXf*GGG7KI<E XKR 
CATTTTTGGAGTACTCCTCGGACGTTCGGTGGAGGCACOAGCT^^ 

850 860 870 880 890 900 


TAAGAGCTCGAATTC 
910 

FIG. K b 


pSWlHPOLYMYC 
Hindlll site AAGCTT 

MKYLLPTAA 
GCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCT^^ 

10 20 30 40 50 60 

AGLLLLAAQPAMAQVQLQ 
GCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCOVGGTGCAGCTGCM 
70 80 90 100 110 PstI 

Polyl inker 
7CTA6A GTC6AC CTCGAG 
Xbal Sail Xhol 

MYC PEPTIDE 

V T V S S EOKLISEEDLM * * 
GCTCACC6TCTCCTCAGAACAAAAACTCATCTCAGAAGAGGA7CTGAATTAATAA 
BstEII 

GGGCTAAGCTCGAATTC 

FIG. 15 
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VH3 QVQLQESGPELVKPGASVKMSCKASGYTFT 
VH8 QVQLQESGPELVKPGASVKMSCKASGYTFT 
Va-Dl • 3 QVQLKESGPGLVAPSQSLSITCTVSGFSLT 


VB3 
VBd 

VH-D1.3 


50 


CDR2 


yiNPYNDGTKYNEKFKG 
YINPYNDGSKYNEKFKG 
MIW GDGNTDYNSALKS 


CDRl 


SYVMH 
SYVMH 
GYGVN 


WVKQKPGAGLEWIG 
WVKQKPGQGLEWIG 
WVRQPPGKGLEWLG 


♦ ^ + : J 

KATLTSDKSSSTAYMELSSLTSEDSAVYYCAV 
KATLTADKSSNTAYMQLSSLTSEDSAVYYCAR 
RLSISKDNSKSQVFLKMHSLHTDDTARYYCAR 


94 


VK3 
VH8 

VH-D1.3 


95 a>R3 


LLLRYFFDY 

GAWSYYAMDY 

ERDYRLDY 


113 

WGQGTTVTVSS 
WGQGTTVTVSS 
WGQGTTLTVSS 


FIG. 16 


FR1 

QVQLQESGGGLVQPGGSLRLSCAASGFTFS 



SYAMS 

CDR1 

FR2 

WVRQAPGKGLEWVS 



AISGSGGSTYYADSVKG 

CDR2 

FR3 

RFTISRDNSKNTLYLQMNSLRAEDTAVYYCAM 



WRGIATPVSFDLGYFDY 

CDR3 


FIG. 17 
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PstI 


BstEII 


rearranged VH genes 
from immunised mouse 
spleen ONA 


1111 


pSWIHPOLYMYC 


PSti 


-c 



VHD1.3gene 

PGR amplify rearranged VH genes or 
VHD1.3. Exdse VH band from gel. 
Clone into vector for expression of VH 
doniains in E.coli 


pSWIHPOLY 


repertoire of expressed 
VH domains from spleen 


repertoire of expressed 
.VH domains with mutant 
CDR3 regions 


Assay for binding to antigen 


FIG. 18 
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PSW2HP0LY 
Hindlll AAGCTT 

MKYLLPTAA 
GCATGCAAATTCTATTTCAAGGAGACACTCArAATGAAATACCTATTGCCTACGG» 

10 20 30 40 50 60 


AGLLLL'AAQPAMAQVQLQ 
GCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGCAG 
70 80 90 100 - 110 PstI 


TCTA6A 6TCGAC CTCGAG 

Xbial Sail Xhol 

V T V S S 
GGTCACCGTCTCCTCATAATAAGAGCTCGAATTCGCCAAGCrTGCATGC 
BstEII 430 440 450 460 470 480 


MKYLLPTAAAG 
AAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAK^ 

490 500 510 520 530 S40 


LLLLAAQPAMAD IVLTQSP A 
TTGTTATTACTCGCTGCCCAACCAGCGATGGCCGACATCGTCCT6ACTCA 

550 560 570 580 590 !600 


SLSASVGETVTITCRA SGNI 
TCCCTTTCTGCGTCTGTGGGAGAAACTGTCJVCCATCACATGTCGAGCAAGTGGGAATATr 
610 620 630 640 650 .660 


HN YLAWYQQKQ G K SPQLLVY 
aUAATTATTTAGCATGGTATCAGCAGAAACAGGGAAAATCTCCTCAGCTCCTGGTCTA^ 
670 680 690 700 710 720 


YTT TLADGVPSRFSGSGS GT 
TATACAACAACCrTAGCAGATGGTGTGCCATCAAGGTTCAGTGGa\GTG^ 

730 740 750 760 770 780 


QYSLKINSLQPEDFGSYYCQ 
CAATATTCTCTCAAGATCAACAGCCTGCAACCTGAAGATTTT^^ CAA 
790 800 810 820 830 840 


HFWSTPRTFGGGTKLEIKR 
aVTTTTTGGAGTACTCCTCGGACGTTCGGTGGAGGCACCAAGCTGGAAArCAAACG^T^ 
850 860 870 880 890 900 


TAAGAGCTCGAATTC 
910 

FIG. 19 
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M K Y L L P T 
AAGCTTGCATGO^TTCTATTTCAAGGAGACAGTCATA^ 

10 20 30 40 50 60 

AAAGL LLLAAQPAMAQVQLQ 
GCAGCCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCXCAGGTGCAGCT^ 
70 80 90 100 110 120 

ESGPGLVAPSQSLSITCTVS 
GAGTCAGGACCTGGCCTGGTGGCGCCCTCACAGAGCCrGTCCATCACATGC^^ 

130 140 150 160 170 180 

-GFSLTGYGVNWVRQPP GKGL 
GGGTTCTCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTG 
190 200 210 220 230 240 

EWLGMIWGDGNTDYNSALKS 
GAGTGGCTGGGAATGATTTGGGGTGATGGAAACACAGACTATAATTCAGCTCTC^^ 

250 260 270 280 290 300 

RLSISKDNSKSQVFLKMNSL 
AGACTGAGCATCAGCAAGGACAACTCCy^GAGCCAAGTTTTCTTAAA^ 

310 320 330 340 350 360 

HTD DTARYYCARERDYRLDY 
CACACTGATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCTTGACTAC 
370 380 390 400 410 420 

WGQ GTT VTVSSGGGAPAAAP 
TGGGGCCAAGGCACCACGGTCACCGTCTCCTCAGGTGGTGGT^TCCAG 

430 440 450 460 470 480 

AGGGQVQLKESGPGLVAPSQ 
GCTGGAGGAGGACAGGTGCAGCTGAAGGAGTCAGGACCTGGCCTGGTGGCGCCCTCACAG 
490 500 510 520 530 540 

SLSITCTVSGFSLTGYGVNW 
AGCCTGTCCATCACATGCACCGTCTCAGGGTTCTCATTAACCGGCTATGGTGTAAACTGG 
550 560 570 580 590 600 

VRQPPGKGLEWLGMIWGDGN 
GTTCGCCAGCCTCCAGGAAAGGGTCTGGAGTGGCTGGGAATGATTTGGGGTGATGGAAAC 
610 620 630 640 650 660 

T D YNSALKSRLSISKDNSKS 
ACAGACTATAATTCAGCTCTCAAATCCAGACTGAGCATCAGCAAGGA^ 

670 680 690 700 710 720 

QVFLKMNSLHTDDTARYYCA 
CAAGTTTTCTTAAAAATGAACAGTCTGCACACTGATGACACAGC» 

730 740 750 760 770 780 

RERDYRLDYWGQGTTVTVSS 
AGAGAGAGAGATTATAGGCTTGACTACTGGGGCCAAGGCACCACGGTCACCGTCTCCTC^ 
790 800 810 820 830 840 

* * 

TAATAAGAGCTC 
850 

FIG. 20 
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MKYLLPTAA 
GCATGCAAATTCTATTTCAAGGAGACAGTO^TAATGAAATACCTATTGCCTACTC 

10 20 30 40 50 1 60 


AGLLLLAAQPAMAQVQLQES 
GCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGCAGGAGTCA 
70 80 90 100 110 120 


GPGLVAPSQSLSITCTVSiSF 
GGACCTGGCCTGGTGGCGCCCTCACAGAGCCTGTCCATCACATGCACCGTCTCAGGGTTC 
130 140 150 160 170 180 


S LTG YGVN WVRQPPGKGLEW 
TCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTGGAGTGG 
190 200 210 220 230 240 


LGMIWGDGNTDY. NSALKSRL 
CTGGGAATGATTTGGGGTGATGGAAACACAGACTATAATTCAGCTCTCAAATCCAGACTG 
250 260 270 280 290 300 


SISKDNSKSQVFLKMNSLHT 
AGCATCAGCAAGGACAACTCCAAGAGCCAAGTTTTCTTAAAAATGAACAGTCTGCACACT 
310 320 330 340 350 360 


DDTARyyCARERDYRLDYWG 
GATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCTTGACTACTGGGGC 
370 380 390 400 410 . 420 


■QGTTVTVSSRTPEMPVLENR 
CAAGGCACCACGGTCACCGTCTCCTCACGGACACCAGAAATGCCTGTTCTGGAAAACCGG 
430 440 450 460 470 480 


AAQGDITAPGGARRLTGDQT 
GCTGCTCAGGGCGATATTACTGCACCCGGCGGTGCTCGCCGTTTAACGGGTGATCAGACT 
490 500 510 520 530 540 


AALRDSLSDKPAKNIILL IG 
GCCGCTCTGCGTGATTCTCTTAGCGATAAACCTGCAAAAAATATTATTTTGCTGATTGGC 
550 560 570 580 590 600 


DGMGDSEITAARNYAEGAGG 
GATGGGATGGGGGACTCGGAAATTACTGCCGCACGTAATTATGCCGAAGGTGCGGGCGGC 
610 620 630 640 650 ' 660 


FFKGIDALPLTGQYTHYA LN 
TTTTTTAAAGGTATAGATGCCTTACCGCTTACCGGGCAATACACTCACTATGCGCTGAAT 
670 680 690 700 710 720 


KKTGKPDYVTDSAASATAWS 
AAAAAAACCGGCAAACCGGACTACGTCACCGACTCGGCTGCATCAGCAACCGCCT6GTCA 
730 740 750 760 770 780 

FiaZla 
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TGVKTYNGALGVDIHEKDHP 
ACCGGTGTCAAAACCTATAACGGCGCGCTGGGCGTCGATATTCACGAAAAAGATCACCCA 
790 800 810 820 830 840 


TILEMAKAAGLATGNVSTAE 
ACGATTCTGGAAATGGCAAAAGCCGCAGGTCTGGCGACCGGTAACGTTTCTACCGCAGAG 
850 860 870 880 890 900 


LQDATPAALVAHVrSRKCYG 
TTGCAGGATGCavCGCCCGCTGCGCTGGTGGCACATGTGACCTCGCGCAAATGCTACGGT 
910 920 930 940 950 960 


PSATSEKCPGNALEKGGKGS 
CCGAGCGCGACCAGTGAAAAATGTCCGGCTAACGCTCTGGAAAAAGGCGGAAAAGGATCG 
970 980 990 1000 1010 1020 


ITEQLLNARADVTLGGGAKT 
ATTACCGAACAGCTGCTTAACGCTCGTGCCGACGTTACGCTTGGCGGCGGCGCAAAAACC 
1030 1040 1050 1060 1070 1080 


FAETATAGSWQGKTLREQAQ 
TTTGCTGAAACGGCAACCGCTGGTGAATGGCAGGGAAAAACGCTGCGTGAACAGGCACAG 
1090 1100 1110 1120 1130 1140 


ARGYQLVSDAASLNSVTEAN 
GCGCGTGGTTATCAGTTGGTGAGCGATGCTGCCTCACTGAATTCGGTGACGGAAGCGAAT 
1150 1160 1170 1180 1190 1200 


QQKPLLGLFADGNMP VRWLG 
CAGCAAAAACCCCTGCTTGGCCTGTTTGCTGACGGCAATATGCCAGTGCGCTGGCTAGGA 
1210 1220 1230 1240 1250 1260 


P'KATYH GNIDKPAVTCTPNP 
CCGAAAGCAACGTACCATGGCAATATCGATAAGCCCGCAGTCACCTGTACGCCAAATCCG 
1270 1280 1290 1300 1310 1320 


QRNDSVPTJjAQMTDKAIELL 
CAACGTAATGA«GTGTACaACCCTGGCGCAGATGACCGACAAAGCCATTGAATTGTT^ 
1330 1340 1350 1360 1370 1380 


SK NEKGFFLQVEGASIDKQD 
A6TAAAAATCAGAAAGGCTTTTTCCTGCAAGTTGAAGGTGC6TCAATCGATAAACAGGAT 
1390 1400 1410 1420 1430 1440 


HAANPCGQIGETVDLDEAVQ 
CATGCTGCGAATCCTTGTGGGCAAATTGGCGAGACGGTCGATCTCGATGAAGCCGTACAA 
1450 1460 1470 1480 1490 1500 


RALEFAKKEGNTLVIVTADH 
CGGGCGCTGGAATTCGCTAAAAAGGAGGGTAACACGCTGGTCATAGTCACCGCTGATCAC 
1510 1520 1530 1540 1550 1560 

FIG. 21b 


52 


EP 0 368 684 B2 


AHASQIVAPDTKAPGLTQAL 
GCCCACGCCAGCCAGATTGTTGCGCCGGATACCAAAGCTCCGGGCCTCACCCAGGCGCTA 
1570 1580 1590 1600 1610 1620 


NTKDGAVMVMSYGNSEEDSQ 
AATACCAAAGATGGCGCAGTGATGGTGATGAGTTACGGGAACTCCGAAGAGGATTCACAA 
1630 1640 1650 1660 1670 :1680 


EHTGSQLRIAAYGPHAANyV 
GAACATACCGGCAGTCAGTTGCGTATTGCGGCGTATGGCCCGCATGCCGCCAATGTTGTT 
1690 1700 1710 1720 1730 1740 


GLTDQTDLFyTMKAALGLK* 
GGACTGACCGACCAGACCGATCTCTTCTACACCATGAAAGCCGCTCTGGGGCTGAAATAA 
1750 1760 1770 1780 1790 1800 


AACCGCGCCCGGGAGTGAATTTTCGCTGCCGGGTGGTTTTTTTGCTGTTAGC 
1810 1820 1830 1840 1850 


F(G. 21c 
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MK YLLPTAA 
GCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCC 
10 20 30 40 50 60 


AGLLLLAAQPAMAQVQLQES 
GCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGCAG6AGTCA 
70 80 90 100 110 120 


GPGLVAPSQSLSITCT VSGF 
GGACCTGGCCTGGTGGCGCCCTCACAGAGCCTGTCCATCACATGCACCGTCTCAGGGTO 
130 140 150 160 170 180 


SLTGYGVNWVRQPPGKGLEW 
TCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTGGAGTGG 
190 200 210 220 230 240 


LGMIWGDGNTDYNSALKSRL 
CTGGGAATGATTTGGGGTGATGGAAACACAGACTATAATTCAGCTCTCAAATCCAGACTG 
250 260 270 280 290 300 


SISKDNSKSQVFLKMNSLHT 
AGCATCAGCAAGGACAACTCCAAGAGCCAAGTTTTCTTAAAAATGAACAGTCTGCACACT 
310 320 330 340 350 360 


DDTARYYCARERDYRLDYWG 
GATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCTTGACTACTGGGGC 
370 ' 380 390 400 410 420 


QGTTVTVSS** 
CAAGGCACCACGGTCACCGTCTCCTCATAATAAGAGCTATCOCGGGAGCTTGCATGCAAA 
430 440 450 460 470 480 


MKYLLPTAAAGL 
TTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCCGCTGGATTG 
490 500 510 520 530 540 


LLLAAQPAMADIELVDLEIK 
TTATTACTCGCTGCCCAACCAGCGATGGCCGACATCGAGCTCGTCGACCTCGAGATCAAA 
550 560 570 580 590 600 


REQKLISEEDLN** 
CGGGAACAAAAACTCATCTCAGAAGAGGATCTGAATTAATAATGATCAAACGGTAATAA6 
610 620 630 640 650 660 


GATCCAGCTCGAATTC 
670 

FIG. 22 


54 


EP 0 368 684 B2 


A 

QVQLQESGPGLVQPSQ SLSI 
CAGGTGCAGCTGCAGGAGTCAGGACCTGGCCTAGTGCAGCCCTC^ 

10 20 30 40 50 60 

G N P 

TCTVSGFSLTSYGVHWVRQS 
ACCTGCACAGTCTCTGGTTTCTCATTAACTAGCTATGGTGTACACTGGGT^^ 

C 

70 80 90 100 110 120 


PGKGLEWLGM IWGDGMTDYN 
CCAGGAAAGGGTCTGGAGTGGCTGGGAATGATTTGGGGTGATGGAA^ 

130 140 150 160 170 180 


SALKSRLSISKDNSKSQVFL 
TCAGCTCTCAAATCCAGACTGAGCATOWXIAAGGACAACTCC^ 

190 200 210 220 230 240 


KMNSLHTDDTARYYCARER D 
AAAATGAACAGTCTGCACACTGAl^CACAGCCAGGTACTACTGTGCCAGAGAG^ 

250 260 270 280 290 300 


YRLDYWGQGTTVTVSS 
TATAGGCTTGACTACTGGGGCCAAGGGACCACGGTCACCGTCTCCTCA 
310 320 330 340 

FIG. 23 
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